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XI — Integration and Fourier Transform 


$1. The Upper Integral of a Positive Function — § 2. L? Spaces — 
$3. Measurable Sets and Functions — §4. Lebesgue-Fubini’s 
Way — §5. The Lebesgue-Nikodym Theorem — §6. Spectral De- 
compositions in a Hilbert Space — §7. The Commutative Fourier 
Transform — §8. Unitary Representations of Locally Compact 
Groups 


Bourbaki has sometimes been blamed for giving priority to function in- 
tegrals instead of first defining measures on sets like everyone and like Borel 
and Lebesgue. From a historical viewpoint, it should be borne in mind that 
since no later than Leibniz, an integral is, strictly speaking, not the measure 
of a surface area: an area is always positive, whereas in all classical definitions 
of an integral, areas above the x-axis are counted positively while those below 
the z-axis are counted negatively. Why? Naturally, in order to ensure that 


the linearity formula 
forssg-af sre fg 


holds, a formula made obvious by Leibniz’s notation and without which the 
whole machinery of Calculus would fall apart. In fact, from Newton to Borel 
and Lebesgue, everyone, even Cauchy and Dirichlet, integrated functions as- 
suming this does not present any problems, and from Riemann to our times, 
endeavouring to extend the definition to more and more general functions. 
Before Cauchy, integration was considered either as the inverse operation of 
differentiation, especially as this is our only chance of computing an integral 
explicitly, or in Euler and Fourier’s work, as a method for proving countless 
identities, for example by integrating series of functions term by term — a 
central problem in Lebesgue theory. By the time we get to Riemann, the 
area alluded to had virtually disappeared from the definition of the integral, 
becoming merely an intuitive geometric interpretation of the latter. 

Soon afterwards, others — Cantor, Peano, Jordan — attempted to define 
the measure m(A) of a subset A of the line or the plane by approximating it 
by finite unions of intervals or slabs. They thus obtained a category of sets 
for which the measure of a finite union of pairwise disjoint sets is the sum of 
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the measures of the individual sets. Definitions of multiple integrals extended 
to simple sets could thereby be made more rigorous, but over the line, stay 
within the framework established by Riemann. The first decisive step is due 
to Borel who, in 1898, defined the measure of an open subset of the line 
to be the sum of the series of the lengths of its constituent intervals (con- 
nected components), introduced “Borel” sets and assigned them a measure 
(the infimum of the measures over all open sets containing them). With this 
definition, not only the measure of a finite union but of a countable union 
of pairwise disjoint sets is the sum of the measures of the individual sets. In 
his thesis in 1902, Lebesgue, who was primarily trying to extend the classical 
relation between integration and differentiation to very general functions, ex- 
panded the definition to measurable sets, i.e. to unions of a Borel set and of a 
set of measure zero. This decisive step enabled him to obtain limit theorems, 
now named after him, which he presented in a famous book in 1905. 

In 1894, Stieltjes had defined the notion of a far more general integral over 
the line — a notion, however, not any “deeper” than Riemann’s (Chap. V, 
n° 32) as it applies to the same functions — from which the notion of surface 
area had totally disappeared. It was founded on an easy generalization of the 
measure of an interval; any physicist familiar with mass distribution could 
have invented it. In 1909, Frédéric Riesz proved that any continuous linear 
functional on the space of continuous functions on a compact interval K is the 
same as a Stieltjes integral; this time, the function integral is at the centre 
of the construction. Riesz’s theorem together with the equally fundamental 
Riesz-Fischer theorem (1907) established the theory in the almost algebraic 
framework of linear functional analysis; it is even one of its main starting 
points. The rest of the story is simply a matter of generalizations to the space 
R” (Radon, 1913) or to “abstract” measures, i.e. defined on sets without a 
topology (Fréchet 1915 and especially Caratheodory in a 1918 book). Around 
the same period, W. H. Young (1911) and P. J. Daniell (1918) showed how to 
define, in the framework of the usual measure, Lebesgue integrable functions 
from the integral of continuous functions without resorting to measures on 
sets. Their method did not meet with success before being systematically 
developed by N. Bourbaki in his book on this topic. 

When he came up with his theory, Lebesgue had no idea of what a func- 
tional space or a vector space was. These concepts progressively appeared 
during the following years, when the theory of integral equations and Hilbert 
spaces, on the one hand, and “abstract” and “modern” algebra, on the 
other, were developed. Lebesgue was too focused on “fine” analysis argu- 
ments about functions of real variables and trigonometric series to convert to 
the new fields that were being developed before his very eyes. This is also true 
of many of those who presented the subject before Bourbaki, a polycephalous 
author familiar with as much algebra as analysis. 

Therefore, those who blamed Bourbaki for having discarded the point of 
view of Borel, Lebesgue and others just because it is more natural (why?) to 
try and measure sets than to integrate functions, are mistaken, all the more 
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so as in general, with Bourbaki, theorems related to the measure of sets, that 
were obviously not forgotten, follow directly from properties of integrals. 
Besides, all that is needed is opening randomly some analysis books and 
articles to see that, with few exceptions, they contain countless integrals and 
very little measures of sets. There are no sacred texts in mathematics and 
every day theories are developed using methods radically different from those 
of their creators without anyone protesting. 

The most serious objections to the point of view presented here came from 
probabilists who, following Kolmogorov (1933), have interpreted a probability 
as a measure on a set without a topology. This point of view, developed in the 
first drafts of N. Bourbaki’s book, had led to a presentation, the Diplodocus, 
whose complexity and ugliness ended up disgusting all members of the group. 
I joined it right when the decision was taken to stick to the view subsequently 
published. If I remember correctly I was instructed to prepare the first de- 
tailed drafts of chapters III and V and the two first ones of chapter IV in this 
new perspective. So I am in a fairly good position to comparer both methods. 
Moreover in general, almost all authors taking the “abstract” view refrain — 
prudently? — from dealing with questions studied in detail by N. Bourbaki: 
measures defined by a density, image of a measure, integration of measures, 
quotient measures, etc. 

In fact, Radon measures are enough for almost all purposes in Analysis 
and in most cases, are better adapted than “abstract” measures. As used to 
say not so long ago a specialist of partial differential equations, « all I ask of 
integration theory is to provide me with Banach spaces» . The definition of 
a measure as a linear functional on continuous functions is perfectly adapted 
to this point of view since Bourbaki’s goal is the construction of DL? Ba- 
nach spaces. By contrast, Rudin’s book, Real and Complex Analysis (1966), 
which does not refer to those of N. Bourbaki though they were published 
ten years earlier, is convincing in this respect: after the “compulsory” chap- 
ter on abstract measures, its author is forced to make the connection with 
the topological viewpoint and prove F. Riesz’s theorem. Thus he is forced 
to start by defining the measure of an open set using a method which, in 
N. Bourbaki, defines more easily (because it is linear) the integral of any lower 
semi-continuous function; after which abstract measures disappear when he 
applies the theory to analytic functions and Fourier transforms. The book 
by P. Malliavin and H. Airault, Intégration, analyse de Fourier, probabilités, 
analyse gaussienne (Masson, 1994) or Integration and Probability (Springer, 
1995), also starts with a presentation about abstract measures, followed by 
an equally long chapter on Radon measures, and then by an excellent chap- 
ter on Fourier transforms where abstract measures are not involved. The 
latter only occur (almost always together with Radon measures) in the two 
chapters on probability theory. Not only can their usefulness be seen in this 
framework, but also the conceptual difficulties and the complicated notations 
resulting from the simultaneous consideration of measures defined on varying 
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“tribes” — all this in a textbook meant for students who probably have not 
yet mastered the fundamental classical theorems. 

Finally, from a pedagogical viewpoint, it is sufficiently clear that someone 
who has taken in N. Bourbaki’s viewpoint will not have any serious difficulty 
in understanding, if necessary, abstract measures and more recent develop- 
ments beyond the framework of locally compact spaces. 


Therefore, like Dieudonné’s presentation in his Eléments d’analyse, my 
presentation closely follows that of N. Bourbaki, probably for the same rea- 
sons as him: we have both spent quite some time developing it, and have not 
yet come across anything better or more complete if we keep to the frame- 
work of locally compact spaces, and at the very least, this framework covers 
95% of the needs of analysts. 

We warn the reader learning about this topic that this chapter includes 
several parts of unequal importance. Lebesgue’s fundamental theorems, with- 
out which nothing can be accomplished, are the subject matter of §81 to 3 
and of the beginning (Lebesgue-Fubini) of § 4. The rest of § 4 shows how using 
Lebesgue-Fubini’s arguments, generalized to “continuous sums” of measures, 
it is possible to unify apparently dissimilar developments: integration with 
respect to the image of a measure, or with respect to a measure defined by 
a density, group invariant measures and quotient measures, etc. §5 charac- 
terizes integrable functions with respect to a measure defined by a density, a 
result that could have been inserted in § 4, and proves the Lebesgue-Nikodym 
theorem. Although §§ 4 and 5 contain important results, mainly theorem 25 
about measures defined by a density (constantly used, usually without real- 
izing it as it seems so natural...) and the Lebesgue-Nikodym theorem, these 
8§ are less universally useful than §§ 1 to 3. Like the countless exercises found 
in Dieudonné and N. Bourbaki, they will help the reader to understand these 
fundamental theorems since it is in general hard to master them without us- 
ing them again and again: as Niels Bohr once said about quantum physics, 
in this field, « the experts are those who have already made all possible mis- 
takes ». He should have added: once and only once, but in his case this was 
self-evident... 

The same can be said about §§6 and 7 on spectral theory and Fourier 
transforms on commutative groups: these topics are perfect illustrations of 
the previous §§ (and as far as I am concerned, mutually influenced each other 
at the end of the 1940s). §7 is not a mere gratuitous generalization of clas- 
sical Fourier transforms: John Tate’s thesis in the 1950s on zeta functions of 
algebraic number fields, where these results are first used, also gives the first 
proof of possible simplifications and clarifications of classical results intro- 
duced by general Fourier transforms and the “adelic” viewpoint. The latter 
was later extended to noncommutative groups in the theory of automorphic 
functions. Finally, § 8 is an introduction to “integral” methods applicable to 
general locally compact groups. 
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§ 1. The Upper Integral of a Positive Function 


1 — Integral of an Isc Function 


Let X be a locally compact space, i.e. a topological space satisfying the Haus- 
dorff separation axiom and in which all points have a compact neighbourhood. 
L(X) will denote the vector space of continuous functions f : X —> C with 
compact support.! It is a union of subspaces L(X,K) where, for any com- 
pact set K Cc X, L(X,K) denotes the set of f € L(X) vanishing outside 
K. Let Lp(X) (resp. Ig(X, K)) be the set of real-valued functions on these 
spaces and, likewise, L,(X) (resp. L4(X,K)) be the set of positive-valued 
functions. The subspaces L(X, kK) are complete with respect to the norm 


IIfllx = sup |f(x)| 


of uniform convergence on X, which we shall denote by || || when no ambi- 
guity is possible. However, L(X) itself may not be so if X is not compact. 

As shown for metrizable spaces in Chap. IX, n° 17 — but this assumption 
only helps to simplify the proof? —, there are enough functions in L(X) for 
the construction of partitions of unity: if A C X is compact and (U;)1<i<n a 
finite open cover of A, then there are f; € L4(X) such that }° fi(a) = 1 on 
A and whose supports are contained in the sets Uj. 


(i) Positive measures. A measure on X is a linear functional 
pe: L(X) 3 C 


whose restriction to each L(X, K) is continuous with respect to the topology 
of uniform convergence; this amounts to attributing to every compact set 
K CX aconstant M(K) such that 


fe L(X, K) = [nA s M(A)I FI. 


will be said to be real if (f) is real for f real, and to be positive if u(f) > 0 
for all f > 0. Examples of such measures were given in Chap. V, §9. If X is 
an oriented differential manifold and w a differential form of maximum degree 


! The support of a function is the smallest closed set outside which the function 
is zero. 
See N. Bourbaki, Topologie générale, Chap. X, in particular § 4 on normal spaces, 
i.e separated and satisfying the following condition: if A and B are two disjoint 
closed sets, there exist open disjoint sets U D> A and V D B. As shown by 
Urysohn, this condition means that there is a continuous map from X to [0,1] 
equal to 1 on A and 0 on B, or that any continuous map from a closed set FC X 
to [0,1] extends to X. A locally compact space is not necessarily normal, but 
every compact subspace of X is. Thus, in this case, Urysohn’s theorem continues 
to hold if A is supposed to be compact and B to be the complement of a compact 
neighbourhood of A. 


2 


© Springer International Publishing Switzerland 2015 5 
R. Godement, Analysis IV, Universitext, DOI 10.1007/978-3-319-16907-1_ 1 


6 XI — Integration and Fourier Transform 


on X, the support of the form fw is compact for all f € L(X). Thus, setting 
u(f) = Jy fw (Chap. IX, n° 17), we get a measure on X. 
As we will see, Leibniz’s notation 


wit) = ff Fle sdu(e), 


essential in applications, is hardly needed in most of the theory. 
For positive measures, continuity follows in fact from positivity thanks to 
the next lemmas: 


Lemma 1. Let js be a positive linear functional on L(X). Then 


(1.1) le(A)| < w(lfl) for all f € L(X). 


This is obvious if f is real since f = ft—f-, |f| =f*4+/f7.If f =gt+ih 
is complex-valued, multiplying f by a complex factor with absolute value 1, 
u(f) may be assumed to be real and > 0. Then p(k) = 0 and |u(f)| = u(f) = 
(9) S M(If]) since g < |g < |fl- 


Lemma 2. For any compact set K C X, there is a constant M(t) such 
that 


(1.2) MAS Mx()IIfll for all fe L(X, Kk). 


Indeed, there is a positive-valued continuous function p on X equal to 1 
on K and zero outside a compact set K’ > Kk. Then |f(x)| < || f||p(@) for all 
f € L(X, K), and so 


MAIS HF) Sw (lfllp) = flle@), 


qed. 

Every measure is obviously of the form yp’ + iv” where yw’ and p” are 
real. Though elementary, it is slightly less obvious that every real measure is 
the difference of positive measures. We will prove this in n° 17 (theorem 29) 
when we will need it: integration theory is mostly about positive measures, 
and in what follows, we will almost always refrain from mentioning that the 
measures considered are. 


(ii) Dini’s Theorem. Let us start with a locally compact space X and a 
(positive!) measure « on X. As already stated in Chap. V, the first step 
towards Lebesgue theory is to define (vy) for every lower semi-continuous 
function y with values in [0, +00]. 

We start by recalling (Chap. V, §2, n° 10) that an increasing philtre is 
a non-empty set ® of functions defined on X, with values in ]|—oo, +00] and 
satisfying the following condition: For all f,g € ®, there exists h € ® such 
that sup(f,g) < h. If there are finitely many functions f; € ®, then there 
exists g € ® such that g > f; for all 7. 
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Increasing philtres generalize increasing sequences of functions and en- 
ables us to generalize the notion of limit. Given a function I : 6 —> C and 
some a € C, we will write 


lim I(f) =a 
if, for any number r > 0, there is a function r’ € ® such that 
fe® & fer = |l(f)-al<r. 


The reader will easily be able to extend the elementary properties of limits to 
this general framework: the function f plays the role of the traditional index 
n € N and IJ(f) replaces u,, ; the fact that neither f < g nor g < f in general 
(ie. no “total” order), contrary to what happens in N, makes no difference. 
In practice, we mostly need to consider functions J with values in 
]—o0, +00] and increasing, i.e. such that f < g implies I(f) < I(g). Then 


lim I(f) = sup J(f), 
fee 


which is readily explained by the case of an increasing sequence, and by 
convention, the left hand side is defined using this relation when the right 
hand side is equal to +oo. If it is written a, then the following properties 
hold: 

(i) I(f) <a for all f € &, 

(ii) for all M < a, there exists r’ € @ such that I(r’) > M, in which case 


f>r’ implies M <I(f) <a. 


Choosing I(f) = f(a) for a given x € X, this give the definition of a 
function 


xt sup f(x) < +00, 
fE@ 


which is just the upper envelope of the f € ®, written sup(®). 


Lemma 3 (Dini’s Theorem). Let © C Lp(X) be an increasing philtre 
yp = sup(®). Suppose that y € L(X). Then 


(1.3) lim |p — fllx =0 and p(y) = limu(f) = sup u(f). 
®& 


This result says that @ converges uniformly on X if its limit is continuous 
and with compact support. 

The proof is the same as in Chap. V, n° 10. We first suppose that & C 
L,(X). Let r be anumber > 0 and K the support of y. As y(a) = sup f(z) is 
everywhere < +00, for alla € K, there exists f € ® such that f(a) > y(a)—-r. 
This relation continues to hold in an open neighbourhood V(a) of a since f 
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and y are continuous. As K is compact, there are finitely many points a; € Kk 
such that the set V(a;) cover K, as well as functions f; € ® that are upper 
bounds for y—r in V(a;). There is a function r’ > sup(f;) in @ since there are 
finitely many f;, and so y(a) —r < r’(a) < y(a) for all  € K and perforce 
for all f € ® such that r’ < f. As these functions f are positive-valued and 
bounded above by y, they all vanish outside kK. As a result, 


rs f= lly—filx <r, 
which proves the first relation (3). Moreover, 


uly) — wf’) = we — f') < Mk (wile - f!llx < Mk(u)r, 


proving the second one. 

If the functions f € &@ are not all positive, replace them with f — fp, where 
fo € @ is chosen once and for all, and only consider functions f > fo. The 
f — fo clearly form an increasing philtre converging to y — fo. The results 
concerning ©® readily follow. 


(iii) Integral of an Isc function. Given a function y : X —> ]—oo, +c0], we 
shall denote by Ling(y) the set of f € Dg(X) such that f < w. It is necessary 
to assume that v(x) > 0 outside a compact subset of X. Otherwise Ling (y) 
would be empty. This is an increasing philtre since f’, f” € Lins(y) => 
sup(f’, f’”) € Line(y). The functions y : X —>+ ]—oo, +00] for which 


(1.4) p(x) = sup f(z) 
Lint () 


for all « € X are characterized by the condition of being positive outside a 
compact set and lower semi-continuous (Isc): for alla € X and all M < y(a), 
y(«) > M in the neighbourhood of a. The condition is necessary since, by 
definition of a supremum, there exists f € Ling(y) such that f(x) > M for 
x = a, hence by continuity, also in the neighbourhood of a. So y(a) > M 
in this neighbourhood. Conversely, suppose that ylsc and positive outside a 
compact set A. As shown in Chap. V, n° 10, (vi) by generalizing the classic 
argument applied to continuous functions, y is bounded below over K ; if 
m is its minimum over K, then y(x) > m everywhere. Then there exists 
g € Lg(X) such that g < y (take g < 0 everywhere and < m over kK), and so 
p= 9+’ where y’ is Isc and positive. As Ling(y) is the set of g + f’ where 
f’ € Lint(y’), the proof reduces to showing that y’ is the upper envelope of 
functions f’ € Ling(y’). Suppose that y’(x) > M in an open neighbourhood 
V of a given point a. Clearly, there is some f € L,(X) zero outside V, 
bounded above by M in V and equal to M at a. Then f € Ling(y’), and the 
result follows.? 

3 In practice, an Isc function positive outside a compact set is the upper envelope 


of a countable family of continuous functions with compact support. This result 
for the moment unnecessary will be proved later [n° 11, (ii)]. 
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T (resp. Z,) will denote the set of Isc functions on X with values in 
]—o0, +00] and positive outside a compact set (resp. everywhere). The upper 
envelope of a finite or infinite family of functions f; € Z is also in Z. This 
is also the case of the lower envelope or of the sum of a finite family. The 
unordered sum (Chap. II, n° 15) of a series of positive lsc functions is also 
Isc since it is the upper envelope of its partial sums, all of which are lsc. 

For all y € Z, define the upper integral of y by 


(1.5) u(y) = sup p(f) < +00. 
Lint (¢) 


Clearly, u*(f) = u(f) for all f € Dp(X) and 


gs<ypSr(y) <r). 


Lemma 4. Let ® be an increasing philtre of continuous functions with com- 
pact support and ~ its upper envelope. Then 


(1.6) e(p) = sup u(f) = lim n(f). 
@ 


The first expression is obviously greater than the second one. So, it suffices 
to prove the reverse inequality. For any g € Ling(X), let 6, C Lp(X) be the 
set of functions of the form inf(f,g) where f € ®. Since, generally speaking, 


f > f' = inf(f,g) > inf(f’,9), 


@, is an increasing philtre; it converges simply to g. Indeed, for all x € X 
and M < (zx), there exists f € ® such that f(x) > M. As g(x) < y(a), 
we can take M = g(x) —r, where r > 0 is given. For f’ = inf(f,g) € 5, 
f'(«) > g(x) — r. The result follows. 

Lemma 3 applied to ®, then shows that 


u(g) = sup plinf(f,g)] < sup u(f) 
EP, @ 


since inf(f,g) < f. As the right hand side does not dependent on the function 
g € Lint(v), the supremum of all y(g), namely p*(y), is inferior to it, proving 
the lemma. 


Theorem 1. The function p> p*(y), yp € TZ, has the following properties: 
(i) Additivity: 


Ee +y) = HY) +H); 
(ii) Passage to the limit for increasing sequences: 


iL” (sup Yn) = sup L* (Yn) , 
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and more generally 


we (sup ¢) = sup 1" (p) 
@ @ 
for any increasing filtering set 6 C T; 
(iii) Term by term integration of all positive function series: 


#* (So en) = Doe" (en) < +00. 


The proofs are the same as in Chap. V, n° 10 and 11. To prove (i), 
choose two increasing philtres ® and W in L,(X) converging to y and w and 
note that the functions f + g, with f € ® and g € W, form an increasing 
philtre converging to y+. It then remains to apply lemma 4. To prove (ii), 
consider the set Y = LU Lins(y) of functions bounded above by at least one 
of the y € ®. That it is an increasing philtre is immediate. It converges to 
w = supy since, for all x € X and M < w(x), there exists y € & such that 
p(x) > M, hence also some f € Ling(y) C W such that f(x) > M. This done, 
lemma 4 shows that 


u°(%) = supp(f) = sup sup p(f) = sup p*(~) 
wv ® Line (¥) ® 
[associativity of suprema: Chap. II, (9.9)]. Finally, (iii) is obtained by applying 
(ii) to the partial sums of the series. 
Clearly, u*(ty) = tu*(y) for all scalars t > 0, agreeing, as will always be 
the case, that 


a.+oo=+00 if 0<a<+oo, 0.4w=0. 


Like in Chap. V, n° 10 and 11, an entirely similar theory could be constructed 
for upper semi-continuous functions admitting functions of Dp(X) as upper 
bounds. It reduces to the previous case since a function f is usc if and only 
if —f is lsc. 


2 — Upper Integral of a Positive Function. 
Null Sets, Reasonable Sets 


(i) Upper integrals. Let us begin with a function f on X taking values in 
(0, +00]. There are functions y € Z such that y > f, for example the function 
which is everywhere equal to +oo. Then define the upper integral of f by 
setting 


(2.1) u'(f) = inf y*(y), 


so that y*(f) € [0, +00]. 
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Despite the adopted notation — I will also sometimes write 
wf) = f Hedy (e) 


—, this definition is different’ from the one given in Chap. V, n° 1, which at 
best has only the pedagogical merit of being simple. 

If y is lsc, definition (1) clearly provides the same value as (1.5). Another 
trivial but constantly used point is that 


(2.2) f<g=u(f) <u"). 


Also, 


w(tf) =tu"(f) for t 20 


provided that here too, the calculation rules for the symbol +o set out above 
are respected. 
In particular, we define the outer measure of a set A C X by setting 


(2.3) w"(A) = we (xa) 5 
which is the upper integral of the characteristic function of A. Then 
(2.3’) w'(A) = inf u"(U), 


where U varies in the set of all open subsets containing A. As the functions 
xu are lsc and upper bounds of x4, the left hand side is less than the right 
hand one. For the reverse inequality, there is nothing to show if u*(A) = 
+oo; otherwise, for all r > 0, there is a function Iscy > x4 such that 
(py) < p*(A) +1; for the open set U = {y(x) > 1}, xu < y and so 
p*(U) < p*(A) +r. (8’) follows since A Cc U. 

The outer measure of every compact set K C X is clearly finite: indeed, 
there is some f € L4(X) which is > 1 on K, and so y*(K) < p(f) by (2). 

As already seen in Chap. V, n° 11 in the case of open sets, most statements 
related to the measure of sets are obtained more or less trivially by applying 
the statements valid for arbitrary functions to their characteristic functions. 

Non-trivial properties of y*(f) proved for lsc functions (theorem 1) can 
be partly extended to the general case: 


4 Tf X is a compact interval in R, then as was seen in n° 2 of Chap. V, in Riemann 
theory, the upper integral of the function equal to 1 on X NQ and 0 everywhere 
else is equal to 1. In Lebesgue theory, it is equal to 0, as we will see later for any 
countable set. 
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Theorem 2. The function p* has the following properties: 
(i) Countable convexity: 


(2.4) #* (So fa) Soo" Un) S +00 


for all series of functions with values in [0, +00] ; 
(ii) Passage to the limit for increasing sequences: 


(2.5) u* (lim fy.) = lim pw" (f,) < +00. 


To prove (4), it suffices to show that 


(2.6) wef +9) < we (f) +29) 


for all f and g, and then to apply (5) to the partial sums of the series f,,. 

(6) is obvious if the right hand side is infinite. Otherwise, for all r > 0, 
there are Isc functions y > f and w > g such that u*(y) < p*(f) +r and 
u*(b) < w*(g) +7. Thus 


M(ftg<ewMetwW=K(e) +H) swf) +H" (g) +27, 


which leads to (6). 

To prove (ii), first note that, by (2), the right hand side of (5) is less 
than the left hand one. Hence it suffices to show the reverse inequality. This 
requires a proof only if the right hand side is finite. 

Let us take a sequence of numbers r, > 0 and, for all n, consider a Isc 
function w, satisfying 


Un 2 fn and pe" (Yn) SM" (fn) +7 n- 
Then setting 
Qn = sup (t1,...,Un), whence y, > sup(fi,---,fn) = fn 
Pn+1 = SUP (Yn; Yn41). So 
Yn+1 + inf (Pr, Pnt1) = Pn + Yn4i - 
As a result (theorem 1), 
He” (Pn4i) +H" [inf (Gn, Pnti)] =H (Gn) +H (bnt1) - 


Since inf (Yn, Wn4i1) > inf (fn, fn4i) = fn, the left hand side is greater than 
B* (Pn41) + L* (fr), and so 
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Adding the first n — 1 relations sidewise, it follows that 


fe (a) > Ua) S Piet ee BG 


since Y1 = w. Choosing r, = r/2” for a given r > 0 thus gives an increasing 
sequence of Isc functions y, > fr satisfying 


EL (Gn) S we (fa) +7 


for all n. f = sup(fn) is bounded above by the upper envelope ¢ of all yy, 
and (theorem 1) 


Lf) S w*(y) = sup 2" (on) S sup Hw" (fn) +1, 
qed. 
Corollary. For any countable family of sets An C X, 


(2.4’ wt (Un) < Sou" (An) 


and for any increasing sequence, 
(2.5’) uw (U An) = lim p* (An) . 


These results must not be applied to arbitrary increasing philtres of posi- 
tive functions; countability is essential here. 


(ii) Null sets. Let us now introduce the basic notion of a measure-zero-set 
or null set, i.e. such that 


(2.7) y*(N) = pt (xx) =0. 


By (3’), this is equivalent to the requirement that, for all r > 0, there exists 
U such that 


(2.7°) NcU & pU)<r. 


Theorem 3. All subsets of a null set are null; the finite or countable union 
of a family of null sets is null. 


Obvious. 

With respect to the usual Lebesgue measure on R, a set reduced to a 
unique point is obviously null. Hence this is also the case of all countable 
sets, for instance Q. The converse is, however, false. The most well-known 


> Do not assume that if the sets Ay were pairwise disjoint, we would get an equality: 
as we will see further down, for this the sets A, need to be assumed to be 
measurable. 


14 XI — Integration and Fourier Transform 


counterexample is the Cantor triadic set of numbers « € K = [0,1] which do 
not contain the digit 1 in their base-3 expansion. It can be easily shown (n° 6) 
to have measure zero (fanciful proof: the probability that the expansion of a 
randomly chosen number does not contain the digit 1 is zero). It is equipotent 
to R: the map f from the Cantor set to [0, 1] associating the binary number 
f(z) = O, a4 a5..., with 7) = a; if x = 0 and wv, = 1 if x; = 2, to every 
x = 0,21, %2,... whose base-3 expansion contains only Os and 2s is surjective, 
which suffices. 

When a relation P{x} depends on one variable  € X, P{x} is said 
to be true almost everywhere (ae.) if the set of points x for which P{z} is 
not true has measure zero: with respect to the Lebesgue measure, almost 
every real number is irrational and even transcendental because the set of 
algebraic equations with rational coefficients is countable, and hence so is 
the set of their roots. If, taken separately, each one of a finite or countable 
family of propositions P,,{x} is true almost everywhere then, by theorem 3, 
they are all simultaneously true almost everywhere. For example, the sum of 
a series of functions zero almost everywhere is zero almost everywhere. The 
set of complex-valued null functions, i.e. zero almost everywhere, is clearly 
a vector space V(X; 4) = NV. Despite the terminology, supposedly null sets 
and functions are far from being negligible. They actually serve to cover up 
horrors that are useless to consider when calculating integrals because they 
do not occur in the calculations. 


(iii) Reasonable sets and functions. Let us start by prove the following 
result: 


Theorem 4. Let f be a function with values in [0,+00] ; then 


(2.8) p(f) =0 <=> f(x) =0 ae., 
(2.9) L*(f) < +00 => f(x) < +00 ae. 


Any function such that p* (|f|) < +oo vanishes outside a countable union of 
sets of finite outer measure. 


Supposing that f > 0 and y*(f) = 0, let us consider the set N, of x where 
f(x) > 1/p and its characteristic function x». Then f > xp»/p, whence 


(Np) = 2 (Xp) S pe" (f) = 0. 


Being the union of the sets N,, the set N = {f(x) > 0} has measure zero. 
To prove the converse, choose N, = {p< f(x) <p+1}. This time fy, < 
(p + 1)Xp. So u* (fxp) < (P+ 1)u* (Xp) = 0. As f = Dison, w* (If) = 0 
by theorem 2, leading to (8). 

To prove (9), we set A, = {f(x) > p} and again let x, be the character- 
istic function of A,. Then f > px», whence p*(vp) < u*(f)/p. As the set 
N = {f(x) = +00} in the intersection of the sets Ap, w*(N) < p*(f)/p for 
all p, giving (9). 
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Finally, if u* (|f|) < +00, there is a lsc function y > |f| such that p*(y) < 
+oo. So it suffices to prove the third proposition of the statement for y. Now 
the set {y # 0} is the union of the open sets U,, = {y(x) > 1/n} and each one 
of them has finite outer measure since its characteristic function is bounded 
by y, up to a factor n, qed. 

To simplify the language and at the risk of introducing unorthodox termi- 
nology, I will say that a set FE C X is reasonable if it is contained in the union 
of a countable family of sets of finite outer measure, and that a function f 
is reasonable if so is the set {f 4 0}. By the previous theorem, that is the 
case if pu* (|f|) < +00. Clearly, every subset of a reasonable set is reasonable, 
and so is the countable union of reasonable sets. Finally, the measure y itself 
will be said to be reasonable if X is the union of a countable family of sets 
of finite outer measure, i.e. is reasonable, in which case all sets and functions 
are reasonable with respect to wu. 

Since the outer measure of every compact set is finite, any set E Cc X 
contained in the union of a countable family of compact sets and of a measure- 
zero-set is reasonable. The converse will be proved later. If the space X is 
countable at infinity, i.e. the union of a countable family of compact sets, 
then all subsets of X as well as all measures on X are reasonable. Due to this 
remark, these notions lose much of their importance since in practice, spaces 
over which integration is done are almost always countable at infinity. But 
introducing assumptions of countability at infinity for X or the measure pu 
would not affect the validity of the fundamental theorems of § 3 nor simplify 
their proofs. These assumptions will be useful, even essential, § 4 onwards. 

Exercise. Let X be a discrete set: all subsets of X are open, the compact 
subsets are the finite subsets of X, all functions are continuous. Choose the 
measure p(f) = >> f(x). Show that a subset of X has finite outer measure if 
and only if it is finite, and is reasonable if and only if it is countable. 


3 — FP Spaces 


(i) Definition of F? spaces . For a complex-valued function f, we set 


(3.1) Ni (f) = 2" (If) < +00 
and more generally 
(3.1) N,(f) = wt (\f/P)'/? 


for any real number p > 1. By far the two most important cases are p = 1 
and p = 2, but to deal with them simultaneously as will be done here, one 
might as well consider the general case: it is not harder and avoids unnec- 
essary repetitions imposed on their readers by the vast majority of authors. 
Moreover, general L? spaces occur in many applications of the theory. Like 
N. Bourbaki in Intégration, Chap. II, the case of functions with values in 
a Banach space H could be dealt with at the same time without changing 
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notations: it would suffice to denote the norm of a vector a by |a|, which 
is not forbidden by higher authorities, and the function x +> |f(«)| by |/, 
and to agree that what we denote by L(X) is the set of continuous functions 
with compact support and values in H; at least up to the Lebesgue-Fubini 
theorem, results and proofs will be exactly the same as those for complex- 
valued functions; this the reader can check step by step.© The only additional 
difficulty lies in the definition of the integral of a vector-valued function; we 
will do this in the next n°. 


For p > 1, the only case considered, the function N, has quite simple 
properties: 


(3.2) Np(af) = lol Np(f) 

for all scalars a. It is equally clear that 

(3.3) If] < lg] => Np(f) S Np(g)- 

Last but not least, Minkowski’s inequality 

(3.4) No(f +9) < Np(f) + Np(g) 

and Holder’s inequality 

(3.4) Nr(f9) S Np(f)Na(g) if I/p+1/q = 1/r 


of Chap. V, n° 14 hold in this general context because they are only based on 
the formal properties (2.3), (2.4) and (2.5) of the function p*(f) for f > 0. 
Inequality (4) is trivial for p = 1 — it suffices to write that |f+g9| < |f|+ |g|- 
and (4’) is mostly useful for r = 1 in the form 


(3.4”) le" (f9)| < No(f)Na(g) if 1/p+1/q=1. 


This assumes p > 1 and q > 1 since p and q must be finite. 

The set F?(X; 1) = F? of complex functions such that N,(f) < +00 is, 
therefore, a vector space on which the function Np is a norm, except for one 
detail: by (2.8), 


(3.5) N,(f) =0 is equivalent to f(x) =0 ae. 


and not to f = 0. If two functions f and g are equal almost everywhere, then, 
by (4) and (5), Np(f) < Np(g) + Np(f — 9) = Np(g). Thus symmetry implies 


(3.6) f =g ae. => N,(f) = Np(g)- 


° The case of functions with values in a Banach space is not an unwarranted 
generalization of the classical theory. It is an essential part of the representation 
theory of locally compact groups (n° 19) and of many other questions. This does 
not force the reader to be concerned with it before needing to do so. 


§1. The Upper Integral of a Positive Function 17 


Hence the number N,(f) does not depend on the class of f modulo the 
subspace NV of null functions; denoting — very provisionally — this class by f, 
we set 


(3.7) IIfllp = Np(f). 


Relations (2) and (4) hold for these classes, and as all has been done so that 
\|f |p = 0 implies f = 0, the quotient space 


F(X; ) = FP = F(X; w)/N(X; pn) 


becomes a genuine normed vector space. Convergence in this space, defined 
by 


lim N,(f — fn) =0, 


is called convergence in mean of order p; instead of writing 


tim f | f(0) ~ fala)? d(x) = 0, 


which for explicitly given functions can be quite cumbersome, it is sometimes 
convenient to use the expression 


f(x) = lim.” fr(x) 
(limit in mean). 
(ii) Convergence in mean and almost everywhere. Proving relations be- 
tween convergence in mean and simple convergence or, more generally, al- 


most everywhere, is one of the basic features of the theory, as the next result 
already shows. 


Theorem 5. Let > fn(x) be a series of complex-valued functions such that 
>= No(fn) < +00 for given p > 1. Then 


(3.8) > lfn(Z)| < +00 ae. 


and any function f such that 


(3.9) fe) = fala) ae. 


satisfies 
(3.10) Np(f) < 3) Np(fn) 
(3.11) f(a) =Lim.? fi(a)+...4+ fr(x). 


The normed vector space F? is complete, and any sequence of functions 
fn € F” converging in mean to a limit f contains a subsequence converging 
almost everywhere to f. 
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Let us start with the case of a series > f, whose terms are > 0 and denote 
the partial sums by s,,. The increasing sequence of functions s? converges to 
f?, where for all x, we set f(x) = >° fn(x) < +00. Then, by theorem 2, 
p*(f?) = lim p*(s?,) and so, raising to the power of 1/p, 


Np(f) = lim Np (Sn) < lim [Np (fi) +--+ Np (fn)] S S Np (fn) » 


whence (10). As this implies N,(f) < +00, f(x) < +00 ae., and (8) follows. 

In the general case, apply the results obtained above to the functions 
|fn| : Since Np(fn) = Np (|fnl) (8) follows, and since (9) shows that |f| < 
>> |fn| ae., relation (10) for the functions |f,| implies (10) for functions fp. 
Relation (11) is obtained by removing the first n terms from the given series, 
which subtracts the partial sum s, of the series from f. Then (10) shows 
that 


(3.12) Np (f — $n) S > Np(fm) » 


m>n 


which leads to the result since the series }> N,( fn) converges. 
To show that F? is complete, let (f,) be a sequence of functions in F? 
and suppose that, for all r > 0, 


Np (fj — fi) <7 for large i and j. 


It is a matter of proving the existence of a function f € F? such that 
lim N,(f — fr) = 0. Like in any metric space, this means that it suffices 
to show that a subsequence converging in F? can be extracted from any 
given Cauchy sequence. 

By successively taking r = 1/2,1/27,..., a subsequence such that 


(3.13) Np (Fan a fin) = 1/2" 
can readily be extracted from the given sequence. Since the differences 


satisfy }° N,(gx) < +00, the series 5° g, converges absolutely almost every- 
where and in mean to a function g which, from the first part of the proof, 
is also the limit (almost everywhere and in mean) of the partial sums of the 
series. But (14) shows that 


As the left hand side tends to g(x) ae., one can deduce that 


lim fn, (@) = g(x) + fri (@) ae. 
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Denoting by f(x) the limit function — regardless of its value at points 
where it is defined —, by (15), 


9-GJ--— Ge =F - fregs ae.. 


So 


lim Np (f — frags) = 9 


since the series }> g, converges in mean to g. This leads to a subsequence 
converging to f both in F? and almost everywhere, qed. 

Exercise. Let (a,) be a sequence of real numbers (for example the sequence 
of rational numbers). Show that the series 7 1/|n?2—a,,|'/? converges almost 
everywhere in R. 

The previous theorem shows that F? is a Banach space (Appendix to 
Chap. III, n° 5). In practice, it is not very useful because the functions used 
to define it are far too general for other non-trivial properties apart from the 
previous theorems to be proved. Proper integration theory will be obtained 
by replacing F? with a far more practical closed space, namely the space of 
limits in mean of continuous functions with compact support. 
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§ 2. L? Spaces 
4 — Integrable Functions, L? Spaces 


(i) Integral of an integrable function. A complex-valued function f is said to 
be integrable if, for all r > 0, there is a continuous function g with compact 
support such that 


(4.1) Ni(f-g) =H (If—gl) Sr. 

To require the existence of a sequence of functions f, € L(X) such that 
(4.1’) lim Ni (f — fn) =0 

would be equivalent. The integral of f is then defined by writing 


(4.2) “(f) = lim w(fn)- 


This limit exists and does not depend on the chosen sequence (f,,). Indeed, 


lu fp) = LM fq)| = le fp = fa) | < E(|fp oa Fal) = Ni (fp = Fa) 
< Ni(fp — f) + Ni(f — fa); 
an arbitrarily small result when p and q are sufficiently large. Therefore, the 


sequence j(f;,) satisfies the usual Cauchy criterion and converges. If another 
sequence (g,,) of continuous functions satisfies (1’), then similarly 


e(fn) — L(9n)| < (fn — Gnl) a Ni(fn =a) < Ni(fn _ f) Ni —Ge)s 


whence lim (fn) — “(gn) = 0, qed. There is also a general theorem about 
the extension of a continuous linear functions. Its proof is similar... 

Any continuous function f € L(X) is integrable [take g = f in (1)] and 
its Lebesgue integral equals p(f). As the relation 


(4.3) MP) < Ni(f) 


holds for all continuous functions, by passing to the limit, it can be generalized 
to integrable functions. If f is integrable, so is’ |f| because of inequality 


fl —Igl| <|f—gl, and 
(4.3’) Ni(f) = (|f]) for f integrable. 


Indeed, the same inequality and (1’) show that | f,,| converges in mean to |, 
whence 


w([f|) = lim pe (|fn|) = Tim Ni (|fnl) = Mi (Ff) = i(f)- 


” Thus Lebesgue theory generalizes the absolutely convergent integrals of Chap. V, 
8 7. 


© Springer International Publishing Switzerland 2015 20 
R. Godement, Analysis IV, Universitext, DOI 10.1007/978-3-319-16907-1_2 


§ 2. L? Spaces 21 


On the other hand, it is clear that all functions equal ae. to an integrable 
function are integrable and that the integrals of these functions are equal. 


Lemma 1. If f and g are integrable, so is af + Bg for alla, GEC, and 
ulaf + Bg) = au(f) + Bulg) - 


This follows from the inequality 


(4.4) Mi [(f +9) — fn + gn)] S Ni (fF — fr) + Mi (f — fr) 


and from the linearity of the integral of continuous functions. 


It was stated above that this theory can be generalized word for word 
to functions with values in a Banach space H. Nonetheless, the integral 
u(f) € H has to be defined for every integrable function f with values 
in H and, to begin with, for every continuous function with compact 
support. Though elementary, the construction requires several stages. 

(a) We first suppose that 


(*) f(x) = >> fi(a)ai 
with finitely many f; € L(X) and a; € H. We then naturally set 
Gas) w(E) = So wl fidas. 


If f(x) = >°g,;(x)b,; is another similar representation of f, then there 
are finitely many linearly independent elements cz satisfying relations® 
aj = D> ance, bj = D> Bjxe~, whence 


> fi(t)our = S93 (2) Bir for all k. 


Integrating, one can deduce that >> u(fi)air = >> w(9;) Gj" and so, mul- 
tiplying by c, and summing, that 


So ulfilar = >> (g;)b;. 


This shows that j(f) only depends on f, justifying definition (**). 
(b) As assumptions on f remain the same, inequality 


aia) |u(£)| < (IE) 


needs to be generalized to this case. Recall that absolute values denote 
the norm in H and |f| is the function x + |f(«)|. The quickest proof is 


8 Choose the elements cg so that they form a basis for the (finite-dimensional) 
vector subspace generated by the a; and the b;. 
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based on the following consequence of the Hahn-Banach theorem, which 
I will admit:° for all a € H, 


Cer) jal = sup |ip(a)| , 
where y varies in the set of continuous linear functionals having norm 
<1 on H. Now, for such a form, (**) shows that 


=> u(filela) = u SS fielas)| = p(yof), 


where yo f is the numerical function 


cr > yf = fila 
Since by assumption |y| < 1, |yof(zx)| < |f(x)|, and so 


le lw(E) < wee fl) < w (IE) - 


As this holds for every y having norm < 1, inequality (****) for a = yu(f) 
proves (***), 

(c) Now let f be a continuous function with values in H and vanishing 
outside a compact set K Cc X. For all r > 0, there is (BW) a finite open 
cover (U;) of K such that f is constant on each U;, up to r (uniform 
continuity...); and there exist f; € L4(X) zero outside U; and such 
that So fi(~) = 1 on K. So f(x) = YO fi(x)f(x) for all « € X. Let us 
choose some x; € U; and set a; = f(2;), g(x) = > fi(x)aj, whence 


f(x) — ee) =| fil) [f@) - ail] < A) [F@) - ail 


For any given x € X, the only terms that matter correspond to the indices 
i for which f;(z) # 0. But this implies x € U; and so |f(x) — aj| < r. 
Hence |y(x) — g(x)| < ro fi(z) = r. This shows that f is the uniform 
limit of functions of type (*) vanishing outside a fixed compact set. 


The framework for the present case is actually a finite-dimensional subspace of H, 


so that the Hahn-Banach theorem in finite dimension is sufficient. It is obvious 
(Cauchy-Schwarz) for a Hilbert norm, but less so in the general case. Rudin, 
Real and Complex Analysis, Chap. 5, gives a proof of a special version of the 
HB theorem. The proper statement, found in N. Bourbaki, Espaces vectoriels 
topologiques, is as follows: let C’ be a convex and closes set in a topological 
vector space H. whose topology is defined by seminorms (i.e. locally convex). C 
is then defined by relations Re [fi(a)] < 1, where the f; are continuous linear 
functionals on H. In other words, C' is an intersection of “closed real half- 
spaces”. In exercises 3 and 4 of his Chap. XII.15, Dieudonné also proves some 
particular cases, since he obviously does not wish to have to invoke transfinite 
induction, which in some form or other is essential in the general case (including 
for Banach spaces). The theory was finalized during the war by Dieudonné and 
independently by George W. Mackey. 
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(d) u(f) can now be defined by approximating it with functions f, 
of type (*) zero outside a fixed compact set: inequality (***) shows that 
(u(£,)) is a Cauchy sequence, and so converges to a limit u(f), which is 
obviously the same for all sequences (f,,). Inequality (***) readily gener- 
alizes to this case. 

(e) Finally, let f be an integrable function, i.e. such that there exist 
continuous functions f,, with compact support satisfying 


lim N,(f — f,) = lim p* (|f — f,|) = 0. 


As (f,,) is a Cauchy sequence with respect to the norm Nj, (***) again 
shows that their integrals converge to a limit depending only on f. This 
gives the definition of y(f) in the general case, inequality (***) remaining 
valid. 

Finally note the following linearity property: let f be an integrable 
function with values in a Banach space H and T a continuous linear map 
from H. to a Banach space H’. The function T of is then integrable and 


p(T of) =T [n(£)] 


or, in more standard notation, 


(Sona) [t8@).aue) = rf e(a)du(e). 


This is obvious for functions (*), and, given the continuity of T, the 
general case follows by passing to the limit. The previous relation is 
frequently used when T’: F'+> C is a continuous linear functional on F’. 


(ii) L? spaces; the Riesz-Fischer theorem. More generally, let us now 
define p'” power integrable functions for all p > 1 by requiring that for each 
such function f, there be functions f, € L(X) such that 


(4.5) lim N,(f — fn) =0. 


The set £?(X; 4) = L” of these functions is clearly a vector subspace of F?. 
Since N,(f) remains invariant if we modify f on a null set, only the class of 
f mod N occurs in f € L?. 

The quotient space 


TP(X; pu) = LP(X; w)/N(X; Hu), 


denoted simply by LZ? when no confusion is possible, is a normed vector 
space; it is the closure in F? of the set of classes mod N of continuous 
functions with compact support. Since F? is complete, so is L?, which is, 
therefore, a Banach space (and historically, probably the first of its kind 
apart from Hilbert spaces). Almost always, by abuse of language, one writes 
f € L? instead of f € L? and ||f||, instead of N,(f). Another convenient 
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convention consists in extending the previous definitions to functions only 
defined outside a null set or taking the value +o0 or —oo on such a set. 
Assigning to them arbitrary finite values on the null set in question reduces 
the definition to the previous one. The norm N, of the function thus modified 
and its class modulo NV only depend on the initial function. Hence hereinafter, 
an expression of type «let f be a function defined on X » may also mean 
a function defined on the complement of a null set. This convention may 
seem somewhat strange when encountered for the first time, but one quickly 
gets used to it because of its convenience, provided some minimal caution is 
exercised. 


Lemma 2. Let (f,) be a sequence of functions in L? and f a function such 
that lim N,(f — fn) =0. Then f is in L?. If p=1, then 


(4.6) w(f) = lim u(fn) - 


Obvious. 


Theorem 6. Let (f,,)be a sequence of functions in L? such that )> Np(fn) < 
+00. Then > |fn(x)| < +00 ae. Any function f satisfying f(x) = > fn(a) ae. 
is in L? and satisfies 


(4.7) lim Np (f — fi —--.— fn) =0, 
(4.8) wf) =oeUn) if =1. 


This is theorem 5 applied to functions in L?. By lemma 2, the (class of 
the) limit function f is still in L? and (8) follows from (4) and (7). 


Corollary. Let S> f(x) be a sequence of positive integrable functions. The 
sum of the series is integrable if and only if > f fn(x)du(x) < +00. The 
series then converges almost everywhere and in the space L', and 


fue) We =D f folate) 


It suffices to observe that for a positive integrable function f, Ni(f) = 
mF) = wf). 

Compare with theorems 20 and 21 of Chap. V, n° 23: uniform or nor- 
mal convergence has disappeared from assumptions and restricting oneself to 
regulated functions is no longer necessary. 


Theorem 7 (Riesz-Fischer). The normed vector space L” is complete. A 
sequence of functions f,(a) in L? converging in mean to a limit f contains a 
subsequence converging almost everywhere to f. If, moreover, the sequence 
fn(x) converges almost everywhere to a limit g(x), then f(x) = g(x) almost 
everywhere. 


§ 2. L? Spaces 25 


Being a closed subspace of a complete space, L? is complete. The second 
proposition of the statement is here too theorem 5 in the special case of L?. 

The appearance of “exceptional” null sets in these statements cannot 
be avoided: the behaviour everywhere of a sequence or of a series cannot be 
expected to follow from integral computations. Similarly, statistics provides 
no information on an individual case (the reason why Condorcet had already 
realized around 1750 that statistics are only useful for political leaders). It is 
already noticeable in Riemann theory, but in Lebesgue theory, it can be done 
only if one specifically tolerates null sets about which nothing can be said. 

In some very particular cases, classical analysis can reinforce integration 
theory. 

If for example X is an open subset in C, and if yu is the usual measure 
dady, then a sequence of holomorphic functions f, € L?(X,w) converging 
in LP converges uniformly in every compact set to a holomorphic function 
(Chap. VIII, n° 4, (iv)], which is necessarily their limit in L?. This proves 
that in the space L? considered, the set of holomorphic functions is a possibly 
trivial closed vector subspace. But it was obviously not in order to integrate 
such harmless functions that Lebesgue and his successors worked for half a 
century. And even in this idyllic case, compact convergence does not imply 
convergence in mean. 

Exercise 1. Suppose that X = [0,1], du(a) = dx and set 


f(x) = cos?” Innz where n = 2?. 


Calculate pu(f,) [observe that f, is a trigonometric polynomial whose constant 
term can be calculated by using Euler’s relation and the binomial formula]. 
Using Stirling’s formula show that f, converges in mean to 0. For which 
x € X does lim f,(x) = 0? 

As any function f € L? is by definition the limit in mean of continuous 
functions, theorem 7 shows the existence of a sequence of continuous functions 
with compact support that converges almost everywhere to f(a). The proof 
of theorem 5 even leads to a somewhat more precise though obvious result: 


Corollary 1. For any f € L?, there is a series of continuous functions with 
compact support such that 


UENp(fn) < +00, f(x) = )0 fn(a) ae., 
lim N,(f —fi—...—fn) =0. 
If f is real-valued, the f, may be assumed to be real. Writing y’(a) = 
Se fit (x), o’ (x2) = ¥3 fy, (x) defined Isc functions with values in [0, +00], but 


finite almost everywhere, since N,(y’) < >> Np(fst) < 32 Np(fn) < +00, and 
we get 


f(a) = ¢'(«) — 9"(@) ae., 


as well as u(f) = u(y’) — nly") ifp = 1. 
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This result shows that, for real-valued functions, Lebesgue theory would 
not go beyond the framework of Isc functions if null functions did not exists. 


Corollary 2. Every function belonging to the space L? (p < +00) is reason- 
able. 


It suffices to consider the supports K,, of the functions f,, of corollary 1 
and to note that f vanishes outside the union of these K,, and a null set. 
This result is of little interest when X is countable at infinity... 


Corollary 3. For all f € L”, there is a sequence of functions fr, € L(X) 
and a function F € L? such that 


f(x) = lim fn(a) ae., | fn(a)| < |F(a)| ae. 


If |f(a)| < M almost everywhere, we may assume that |fn(a)| <M for all x 
and n. 


To prove the first point, it suffices to consider the partial sums s,,(a) of 
the series whose existence is ensured by corollary 1 and to choose F(a) = 
>> |fn(x)|. By theorem 6 applied to the |f,|, F € L?. To prove the second 
point, consider the contraction p of the complex plane onto the disc |z| < WM 
given by 


p(z) = Mz/sup (|z|,M) . 


The functions g, = po f, are still in L(X), |gn(x)| < |F'(«)| still holds ae., 
and finally lim f,(2) = lim gn(«) since f(x) < M, qed. 

When, for a sequence of a priori arbitrary functions f,, there is a func- 
tion F’ such that 


N,(F)< +00 &  |fn(a)| <|F(a)| ae., 


the sequence (f;,) is said to be dominated in L?. This notion, almost always 
applied to sequences f,, € L?, plays a fundamental role in Lebesgue’s con- 
vergence theorems. In particular, we will see (theorem 9) that, under the 
assumptions of the corollary, the functions f,, converge to f in L? and not 
only almost everywhere. 


(iii) The case of lsc or usc functions. 


Lemma 3. A lsc or usc function yp is integrable if and only if Ni(yv) < +00. 


As —yg is lsc if y is usc, we need only consider the case when ¢ is Isc and 
show that condition Ni(y) < +00 is sufficient. As y is Isc, yt is Isc, y~ is 
usc and these two functions satisfy the condition of the lemma. 

Let us first consider yt. For all r > 0, there is a function f < yt in 
Lp(X) such that 


uf) Sw (y") = Me") Swf) +r. 
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As yt = f+(y* — f), and as f and yt — f are Isc since f is continuous, by 
property (i) of theorem 1, 


M(y-f)=K (le-f) =u (¢e—-f) =u" (¥) -— Hf) Sr, 


whence yt € L?. 

Next consider y~. Since p*(y~) = Ni(y~) < +00, there exists a Isc 
function w such that w@ > y7 and p*(y) < +00. The function w — y~ 
being Isc,!° positive and bounded above by w, u*(w — yp) < +00, so that, 
by the first part of the proof, ~ — y and w are integrable. As a result, 
pp. =wv-—(w— y_) is integrable, qed. 

Lemma 4. A function f with values in [—co, +00] is integrable if and only if, 
for allr > 0, there exist integrable functions y and, |sc and usc respectively, 
such that 


w<f<p & wWiy-YwW<r. 


If f is integrable, there is a function u € Lg(X) such that p*(|f—ul) <r, 
hence a Isc function @ such that 


If-ul <0, w(@)<r. 


As-0< f-—u<0,u-—0< f <u+6 since uw is finite-valued. Since u is 
continuous, u + @ = y is lsc, u— 6 = w is usc, and like u and 6 (lemma 3) 
these two functions are integrable. Seeing that 


Lu (p — v) = p"(20) < 2r 


the condition is necessary. 

Conversely, if it holds, the function y — w, which is lsc and positive, is 
integrable (lemma 3). As Ni(y — f) < u*(y— wv) < 1, the function f is 
the limit in mean of usc and integrable functions (put r = 1/n), hence is 
integrable, qed. 


5 — Lebesgue’s Theorems 


(i) The dominated convergence theorem. 


Lemma 1. The upper (resp. lower) envelope of a finite family of functions 
of L? is in L?. 

For real functions, inequality |f*—g*| < | f—g| shows that N,(f*—g*) < 
N,(f — 9). If f is in £L?, so are ft and f—. We conclude the proof using 
relations 


10 As v(x) > —oo everywhere, the function —y~ does not take the value —oo, and 
hence is in Z, and consequently so is wy — yp". 
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sup(f,g=f+(g—-f)', imf(f,g)=9-(9-f)t. 


Likewise, an analogous argument using inequality | fl-| || < |f — g| shows 
that 


fer} |fler. 
The converse is almost true (theorem 10). 


Theorem 8. Let (f,) be an increasing sequence of positive functions in 
L?. f = supf, = limf, is in L? if and only if sup Np(fn) < +00. Then 
lim Np(f — fn) = 0. 


The condition is clearly necessary since 0 < f, < f for all n. To obtain 
the converse, it suffices to show that the sequence (f,,) satisfies Cauchy’s 
criterion in L?. 

This is easy if p = 1. Indeed, since f; — f; is integrable and positive for 
i<j, 


Ni (f; — fi) = 1 (F3) — BH CSa) - 


As the sequence pu(f;,) is increasing and bounded above, Cauchy’s criterion 
follows readily. One can also apply the corollary of theorem 6 to the series 
“F n+1 ~~ i n)- 

In the general case, we first note that, if two positive functions f and g 
are continuous and with compact support, then 


(5.1) Np(f)? a N,(g)” < N,(f +g)? s 


To see this, it suffices to apply yz to the inequality f?+g? < (f+g)?. Now, any 
positive function f € L” is the limit in mean of positive functions f, € L(X) 
since, if a sequence f, € L(X) converges to f in L?, so must functions 
Re(fn)*. Passing to the limit, we conclude that (1) holds for positive f,g € 
LP, 

We then consider an increasing sequence (f,,) of positive functions in L?. 
For i < j, (1) can be applied to the positive functions f; and f; — f;, whence 


Np (fy — fa)? S Np (f5)” — No fi)” - 


If sup Np(fn) < +00, the increasing sequence of numbers N,(f,,)? converges 
to a finite limit, and so Cauchy’s criterion holds in L? for the sequence (f;,), 
qed. As mentioned in the case p = 1, one could also apply theorem 6 to the 
series Gere -- nee 


Lemma 2. Any decreasing sequence (fn) of positive functions in L? con- 
verge in mean. 


Apply theorem 8 to functions f; — fn. 


Lemma 3. The lower envelope of a countable family (f,) of positive func- 
tions in L? is in L?. 


§ 2. L? Spaces 29 


Apply the previous result to the functions inf(f1,..., fn); we already know 
(lemma 1) they are in L?. 


Lemma 4. The upper envelope of a countable family of positive functions 
fn € LP is in L” if and only if there is function F > 0 such that!+ 


N,(F)<+00 &  fn(x) < F(x) ae. for all n. 


The condition is necessary: for F' take the upper envelope of the functions 
fn. If it holds, the functions g, = sup(f1,..-, fn) € L? form an increasing 
sequence and satisfy g, < F ae., whence Np(gn) < Np(F) < +00. Apply 
theorem 8 to functions g, completes the proof. 

For real-valued functions f,, it is better to assume that |f,(x)| < 
F(a) ae.. The upper and lower envelopes of the f,, are then clearly in L?. In 
this case: 


Theorem 9 (dominated convergence!”). Let (f,,) be a sequence of func- 
tions inL? converging ae. to a function f. Suppose that there is a function 
F > 0 such that 


N,(F)<+00 & = |fn(a)| < F(x) ae. for alln. 
Then f is in L? and 


lim N,(f — fn) =0, limp(fn) = "(f) if p=. 


It suffices to show that (f,,) is a Cauchy sequence with respect to conver- 
gence in mean. 
As |f; — f;| < 2F ae., lemma 4 shows that the functions 


are in L?, and lemma 2 that this decreasing sequence of positive functions 
converges in mean in L?. But since the given sequence converges outside a 
null set N, the usual Cauchy criterion tells us that lim g, (x) = 0 for « ¢ N.18 


11 The relation fn (x) < f(x) almost everywhere for all n means that, for all n, there 
is null set Nn, possibly dependent on n, such that f,(a) < f(x) for « ¢ Nn, hence 
fora ¢é N=UNn. 

!2 Throughout this theory, a function f or a sequence of functions fp is said to be 
dominated by a function g if |fn(x)| < |g(x)| ae. for all n. As it is constantly 
used, theorem 8 is often called Lebesgue’s theorem. 

13 For a sequence of complex numbers, the relation 


ijr>n=> |ui-uj| <r 
is equivalent to 
Un = sup |uj —uj| <r, 
i,j2n 


so that Cauchy’s criterion means that limvn = 0. 
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Theorem 7 then shows that lim N,(gn) = 0. However, | fi(z) — f;(x)| < gn 
and so 


NoWe wy) = Nola) Sek 19 2 ws 


This gives Cauchy’s criteria and convergence in L?, qed. 
Compare with theorem 19 of Chap. V, n° 23. 


Corollary. Let p and q be two real numbers such that 1 < p, q< +co. For 
all functions f € LP? L4, there exists a sequence of functions f, € L(X) 
converging to f almost everywhere, as well as in L? and in L%. 


Indeed, it is possible to extract from L(X) two sequences (g,) and (h,) 
that are convergent to f ae. and such that f is the limit in L? (resp. £4) of 
the sequence (gn) (resp. (hn )). They can even be assumed to be dominated 
by two functions G € L? and H € L? (corollary 3 of Riesz-Fischer). The 
function w,(%) = gn(£)/|gn(x)| is defined and continuous on the open set 
Un = {gn 4 0}. Let us define f, by 


inf(|gn(x)|;|hn(2)|) wn(a) if gn(x) #0, 


fn(x) = 
0 if gn(x) =0. 
The function f,, is continuous on U,, and as |fn(x)| < |gn(x)| everywhere, 
it is also continuous at every « € X — U,. As a result, fn € L(X). fn(x) 
obviously converges ae. to f(x), and as f, is dominated by G and by H, 
convergence in L? and in L% follow, qed. 


(ii) Relation between L? and L'; Hélder’s inequality. Lebesgue’s theo- 
rem is going to enable us to explain the expression “p-th power integrable 
functions”: 


Theorem 10. A function f is in L” if and only if the function |f|?~1f is 
in L'. Then so is the function |f|? and 


(5.2) Np(f) = wf?) . 


We obviously assume that p > 1 and confine ourselves to finite-valued 
functions as others reduce to them thanks to null sets. 

Let us suppose that f € L, choose (corollary 1 of theorem 7) a series of 
functions f, € L(X) such that 


SN ho < +00, f(x) = Ae ae. 
and set 


$n(t) = file) +... + fale), S(t) = So |fn(2)| - 
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The functions gn = |Sn|?~'s, are still in L(X) since p > 1. On the other 
hand, |gn| = |sn|? < S? = G with a function G having values in [0, +00] 
satisfying 


Ni(G) = Na(S?) = Np (SoIfel)” < (SON) < +00. 


The dominated convergence theorem can therefore be applied in L* to the 
sequence (g,), and as it converges ae. to the function g = |f|?-'f, it is 
integrable, and hence so is its absolute value |f|?. The same argument applies 
to functions |g,,|, which converge ae. to |f|?, whence 


(|F |?) = lim p (gn) = lim p ($n |?) = lim Np (Sn)? 


since the s, € L(X) are integrable. Since the series }> f, converges to f in 
L?, it finally follows that 


e(IFIP) = Noth)? ; 


whence (2). 

Conversely, suppose that g = |f|?~!f € L”. The theorem being trivial for 
p = 1, one may suppose that p > 1. The map z + |z|?~+z from C to C is 
a homeomorphism. Its inverse is z +> |z|~!/4z, with 1/p + 1/q = 1 and so 
0 <1/q <1. To ensure continuity at 0, one needs to assume |0|~1/%.0 = 0. 
As a result, 


(5.3) f(w) =|g(a)|-"/" g(a). 

Let gn € L(X) be a sequence of functions such that 

(5.4) S 7 Ni(Gn) << +00, g(t) = >° gn(a) ae. 
and we set 


fn =l91+--- + onl!" (91 +... +9n) - 


The map z+ |z|~!/4z from C to C being continuous and zero at z = 0, like 
gx for all k, f, is in L(X) for all n. Then 


/ 
fn] = |g +--+ gnl'/? < (Slo!) "ah. 


The first relation (4) implies that Ni (33 |gz|) < +oo, and as a result, 
N,(h)? = Ny (Al?) = Ni OS gel) < +00. Therefore, the dominated con- 
vergence theorem in L? applies to the sequence (f,,). However, (4) shows 
that it converges ae. to |g(x)|~!/%g(x) = f(x), whence f € L”, qed. 
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Theorem 11 (Hé6lder). Let p,q > 1 andr > 1 be real numbers such that 
1/p+1/q=1/r. Then 


(5.5) fel” & gelt=fgeLl 
and 
(5.6) Nr(f9) < Np(f)Na(g) - 


Relation (6) is a particular case of (3.4’), which holds without any in- 
tegrability assumptions. It, therefore, suffices to prove (5). To this end, we 
choose sequences (f,,) and (g,) in L(X) such that 


lim Np(f — fn) =O & limN,(g- gn) =0. 


The proof reduces to showing that lim N,(fg — fngn) = 0. Now, (fn) may 
be assumed to converge almost everywhere to f and to be dominated by 
a function F > 0 such that N,(F') < +00; even (g,) may be assumed to 
converge ae. to g and to be dominated by some G > 0 such that Ng(G) < +00. 
Then the product fngn, being dominated by the function FG whose norm N, 
is finite by (6), converges ae. to fg. Applying Lebesgue’s theorem completes 
the proof. 


Corollary 1. If f,g € L?, then fg is integrable and 


(5.7) lu(f9)| < No(f)No(g)- 


This corollary shows that a Hilbert inner product 
(5.8) (fl) = nen) = f Fea aula) 


can be defined on L?. It is obviously linear! in f, satisfies the condition 
(flg) = (gl f), and finally, by theorem 9, 


(ff) =u (If?) = No(f)? 20, 


which reduces (7) to the Cauchy-Schwarz inequality. This shows that (f|f) = 
0 implies f = 0 ae., or f = 0 if one argues, as one should, in terms of classes 
mod N. As, moreover, L? is complete, we conclude that L? is a genuine 
Hilbert space. This result explains its importance in applications since the 
theory of Hilbert spaces is far simpler and more complete than that of other 
classes of topological vector spaces. We will again come across the L? spaces 
in relation to Fourier transforms, but they have many other applications. 


14 One can formulate the same definition on F?, but the result would not be an 
inner product: the function p* is not linear on F?. 


§ 2. L? Spaces 33 


Relation (6) is mainly of interest for 1/p+1/q = 1, from which r = 1 and 
the same conclusion as above follow: for f € L? and g € L’, the function fg 
is integrable and 


(5.9) M(f9)| < Ni(fg) S Np(f)Na(y) - 


As a result, for given g € L?, the map 


ft wf) 


is a continuous linear functional on L?. It will be shown later (n° 18, theo- 
rem 30) that it is the only one. As q must be finite, this supposes that p > 1. 
For p = 1, a space L®, which we shall also define later, is needed. 


Corollary 2. If f € L”, then fx, € L* for every compact set K Cc X and 
fg€L' for allg € L(X). 


Indeed, the function y, is in L+ (n° 4, lemma 7) and hence is in L4 
for all q (theorem 10), whence fx, € L'. Use the same argument for the 
product fg. 

In fact, both propositions of corollary 2 are equivalent: 


Lemma 5. The following two properties are equivalent for all functions j: 

(LI 1) the function j(x)x,.(x) ts integrable for all compact sets K ; 

(LI 2) the function j(x) f(x) is integrable for all f € L(X). 

The map f ++ u(fj) is then a complex measure on X. 

(LI 1) => (LI 2): if f vanishes outside a compact set K, then fj = f.jy,- 
Hence, corollary 2 applied to fy, gives the result. 

(LI 2) => (LI 1): if fj € L* for all f € L(X), f can be chosen to be equal 
to 1 on Kk, so that, by corollary 2, jx, = fj.v, is integrable. 

The map f + wu(fj) is obviously a linear functional on L(X). If f € 
L(X,K), where K is a compact set, then 


|f(x)§(@)| = |F(@) Xe (@)5H(2)| < NFM (@) Xx (Z)| 
and so 


(FH) S [5X lh Fl, 


whence the continuity of f > p(fj) on L(X, K), qed. 
A function j is said to be locally integrable with respect to yu if it satisfies 
the equivalent conditions of lemma 5. We then set!> 


(5.10) : i(e)du(e) = i §(2) xq (a2) dqu(e) 


© One can define the integral over any measurable set A such that j(x)ya(z) is 
integrable in a similar way, for example if A is contained in a compact set. 
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for every compact set K C X, and we say that fH u(fj) = v(f) is the mea- 
sure with density j (for “Jacobian”) with respect to yu. This is symbolically 
written as 


du(x) = j(x)dp(z). 


Hence, by definition, 


[to ieodute) = f re \i@)-dy(e) 


for all f € L(X). A punctuation mark has been inserted in every integral so 
as to separate the function being integrated from the measure with respect 
to which it is integrated. We will see in n° 16 that the previous relation 
generalizes to integrable functions f with respect to the measure jdu, but 
this result, “obvious from a physics viewpoint” is far from being so from a 
mathematical one... 

For example, d*z = dx/|x| is a measure on R* (but not on R). 


(iii) Applications to Fourier transforms on R. The definition of L? spaces 
adopted above can be directly applied. Let us show for example that there 
exists a unique isomorphism from L?(R) onto L?(R) which, on the Schwartz 
space S 16 | reduces to the Fourier transform. Indeed, we know that the map 
ff from S to S is bijective and preserves the L? norm (Chap. VII, n° 31, 
theorem 28). On the other hand, S is everywhere dense in L?(R) because any 
function f € L(R) can be approximated using C® functions zero outside a 
fixed compact set (Chap. V, n° 27, theorem 26), which implies convergence in 
all L? spaces. Thus the Fourier transform extends to L?(R) and the extension 
is unique. The same would be the case if R were replaced with T: for every 
f € L7(T), there is a Fourier series > f(n)u” for which 


S| Fe)) = 118. 


and conversely. The convergence of the series is not an obvious consequence. 
However, it is possible to show that the symmetric partial sums of the series 
>> f(n)en(u) converge almost everywhere to f(u), a difficult result to prove 
~and false for L1(T). 

One can also define the Fourier transform in L1(R): since L(R) is every- 
where dense in L'(R) and since 


fll < Wl 


holds trivially for all f € L(R), the left hand side being as usual the uniform 
norm on R, the map f +> f can be extended by continuity to a map from 


16 Recall that, it is the space of C’° functions y on R for which all functions 
x? yp) (x) are bounded. 
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L*(R) to the space of continuous functions on R, which is obviously given by 
the formula 


(5.11) Fl) = f f(ejeteajar ae 


where, like in Chap. VII, we set e(x) for exp(27ix). As we know that f 
(Chap. VII, n° 27, theorem 23) tends to 0 at infinity for f € L(R), the 
Fourier transform maps L1(R) to the space L.,(R) of continuous functions 
tending to 0 at infinity. 

It should be noted that for f € L'(R)M L?(R), we thus have at our 
disposal two possible definitions of the Fourier transform. Let us show that 
they are compatible. Indeed, we know (corollary of theorem 9) that there is a 
sequence of functions f, € L(IR) converging to f both in L1 and in L?. Then 
Chel given by (11), converges in Z..(R) (uniform convergence) to function 
(11) for f, and in L?(R) to the Fourier transform of f in the L? sense. Hence 
by the Riesz-Fischer theorem for L?, it is the (class of) function (11). 

This result enables us to calculate if using a formula analogous to (11) 
whenf € L?(R). Indeed, let f,, denote the function equal to f for |z| <n 
and 0 elsewhere. By corollary 2 above, it is in L'. It is also in L? since, being 
the product of |f|f € Z+ and the characteristic function of [—n,n], |fn|fn is 
in L. Its Fourier transform can therefore be calculated using (11). On the 
other hand, f,, clearly converges (dominated convergence) to f in L?. As a 
result, 


or using notation (10), 


+n 

(5.12) fly) =lim? f(a)e(xy)de. 
If, for example f is a regulated function approaching 0 monotonously at in- 
finity, the (ordinary) limit of the integral exists for all y 4 0 (Chap. V, n° 24, 
theorem 23). So, if we know beforehand that f € L?, in other words (exer- 
cise!) that the Riemann integral [ | f(x)|?dx converges, we can replace the 
symbol L.i.m.” with an ordinary limit for y 4 0, which by the way shows that 
it is necessary to include functions that are only defined almost everywhere. 

We will see in n° 10, (iv) how to generalize the Fourier inversion formula 
(Chap. VII, n° 30, theorem 26) to functions f € L'(R), namely that, if 
f € L\(R) as well, then 


(5.13) fle) = | Flwetenddy. 
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Finally note that all this falls in the framework of Fourier transforms of 
tempered distributions [Chap. VII, n° 32, formula (32.11)]. It should first be 
observed that every f € L?(IR) defines a distribution Ty given by 


Ty(y)= f ele)fle)da for pes. 


The integral is well-defined since for all g, y € L%, and Holder’s inequality 
readily shows (exercise!) that Ty is “tempered”. By definition, the Fourier 
transform of T’'y is the distribution 


Ty) =Ts(G) = / Bly) Fly)dy. 


For f € L?, this integral is the inner product of @ and f, and so is also 
equal to the inner product of @ and the Fourier transform of f, which is the 


n~ 


conjugate of f(—a). As G(a) = y(—2) for ye S, 


Top) = if ole) Fle)de 


readily follows. As a result, the Fourier transform of the distribution defined 
by f is the distribution defined by f. 

If it is possible to identify a function f € L? with the distribution T;, then 
the latter can be assigned a Fourier transform, but it is not always defined 
by a function. 
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§ 3. Measurable Sets and Functions 


The definition of integrable functions as limits in mean of continuous func- 
tions with compact support cannot always be used. This § will show that a 
function f is in L? if and only if it is not too complicated and, of course, that 
it satisfies N,(f) < +00. 


6 — Measurable and Integrable Sets 


(i) Properties of integrable sets. In line with the general principle stated in 
n° 2,a set A C X is integrable if so is its characteristic function x4; the 
measure of A is then the number 


(6.1) WA) = w(va) = w(A). 


The following properties are immediate restatements of results from the pre- 
vious n°. An open set U is integrable if and only if u*(U) < +00, and then 
u(U) = p*(U) (n° 4, lemma 3). The intersection of a finite or countable 
family of integrable sets is integrable (n° 5, lemma 3). In addition, for a 
decreasing sequence of integrable sets (A,,), 


(6.2) 1 (Msn )= inf w(A,) = lim p(A,)- 


The union of a finite family of integrable sets is integrable (n° 5, lemma 1). 
The same is true for the union of a countable family (A,,) provided its outer 
measure is finite (n° 5, lemma 4) or, equivalently, provided there is a set of 
finite outer measure containing all the sets A, ; then 


(6.3) 1 (LAn) < 0 (An 


since the characteristic function of A is bounded above by the sum of those of 
the sets A,,. Furthermore, the relation 5> u(An) < +00 suffices to ensure the 
integrability of LU) An, and if the sets A, are pairwise disjoint, inequality (3) 
becomes an equality (theorem 6). If, moreover, the sequence A,, is increasing, 
then 


(6.4) lu (U An) = sup #(An) = lim (An) 


(theorem 8 of n° 5). 
Finally, it is clear that if A and B are integrable and if A Cc B, then B—A 
is integrable and 


(6.5) p(B — A) = w(B) — w(A). 
Any compact set K is integrable since its characteristic function is usc and 
it upper integral is finite. 
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Let us, for example, show that the Cantor set C has measure zero. This 
follows by removing from K = [0,1] its median interval ]1/3,2/3[, then by 
repeating this process for the two remaining intervals, then for each of the 
four remaining ones, and so on. The sum of the measures of the strictly open 
intervals equals 


1/3 + 2/3? + 27/33 +...=1, 


and as they are pairwise disjoint, u(A — C) = 1, and so p(C) = 0. 


Lemma 1. A set AC X is integrable if and only if, for all r > 0, there is 
an open set U and a compact set K such that 


KCACU & wWU-K)<r. 


Every integrable set is the union of a null set and a countable family of 
compact sets. 


By lemma 4 and n° 4, the condition is sufficient since the characteristic 
functions of K and U are respectively usc and Isc. To show that it is necessary, 
we choose integrable functions y and w respectively lsc and usc such that 


(6.6) VSxaSy and wly—y)<e, 


where € > 0 is given. They can be assumed to be positive, if need replacing 
them with yt and yt. 

For any integer n > 1, the relation y(x) > 1 — 1/n defines an open set 
U D> A, and (1—1/n)u(U) < ply) < w(A) +. So WU) < p(A) +r if € is 
sufficiently small and n sufficiently large. 

On the other hand, the function 7 being usc and positive, and so bounded 
above by a continuous, positive function with compact support, the relation 
(a) > 1/2 defines a compact set K C A. As y(x) < 1/2 on A— K and 
W(a) < lon K, d(x) < xa-K(@)/2+x,(x) for all x € A, and in fact for all 
x €X. Asa result, u(y) < u(A— K)/24+ u(r). But (6) shows that 


(xa — 2) = MA) — wh) < w(y—y) Se. 


Hence, it follows that 
yA) —€ < pl) < (A — K)/2+ p(K). 


So w(A— Kk) <e+p(A— K)/2 and finally (A — K) < 2e. 

At last, for compact sets K, C A and open sets U, D A such that 
wWUn — Kn) < 1/n, the set N = A—\|J Kp, contained in U, — K,, for all n, 
has measure zero, qed. 

A consequence of the previous lemma is that any reasonable set E C X is 
contained in the union of a null set and a countable family of compact sets. 
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The converse is obvious. Indeed, by definition, E C UJ En, where pi* (En) < 
+oo for all n. By (2.3’), each EF, is contained in an open set U, such that 
u*(U,) < +00, and so is integrable. As a result, U, = Ny, UU Knp, where 
the sets Kn» are compact and N,, null sets. F is, therefore, contained in the 
union of a null set N =) N,, and a family of sets Ky,, ged. 

The entire space X may be integrable, in particular if X is compact. 
The measure y is then said to be bounded. The whole of X is then clearly 
reasonable, and hence so are all subsets of X. As, on the other hand, |f| < 
llfllv, for any function f € D(X), it follows that 


IMA) < M||fll where M = p(X). 


Conversely, every measure satisfying an inequality of this type is bounded 
since p*(X), a supremum of pu(f) for f € D(X) such that 0 < f < 1, is then 
< M. Hence if we equip L(X) with the norm ||f|| = ||f||x of uniform con- 
vergence on X, bounded measures are just the continuous linear functionals 
on L(X) that are positive for f > 0. If for example j is a positive integrable 
function, then the function fj is integrable for all f € L(R) (corollary 2 of 
theorem 11) and the map 


front = f foile)dula) 
is a bounded measure since | fj] < || f||j and so 


lH(F5)| < M(HIFI- 


We will show later [n° 17, (ii)] that this property characterizes integrable 
functions. 


(ii) Measurable sets. The notion of a measurable set is at least as useful 
as that of an integrable one. That is the name given to any set A C X such 
that AM K is integrable for all compact sets K C X . This is the case of all 
integrable, closed or open sets. Properties of integrable sets regarding unions 
and intersections readily show that 
i) any finite or countable intersection of measurable sets is measurable, 
ii) the complement of a measurable set is measurable, and hence 

(iii) any finite or countable union of measurable sets is measurable. 

In addition: 

(iv) a set A is integrable if and only it is measurable and of finite outer 
measure. 

Indeed, if A is integrable, then so is ANK for any compact set kK. Thus A is 
measurable. Conversely, suppose that A is measurable and that ~*(A) < +oo. 
There is a lsc function y > x, such that u*(y) < +oo. It is integrable (n° 4, 
lemma 3), and so vanishes outside a set of the form N UU Kn, where N is 
null and the K, are compact (n° 4, corollary 2 of theorem 7). The same is 
true for y4. Thus 


—_~— 
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A=(AnN)ULJ(4Nk,), 


which is the countable union of integrable sets since A is measurable. As 
p*(A) < +00, this union is also integrable, qed. 

One could add that 

(v) a measurable set A is reasonable if and only if it is the union of a null 
set and a countable family of compact sets. 

We saw above that, if A is reasonable, then it is contained in a set of the 
form N UU Ky, where N is null and the K,, are compact. AN N is null and 
Af K,, integrable, and so of the form N,, UU Knp, ged. 

Properties (i) and (ii) are sometimes expressed as saying that the set of 
measurable subsets of X is a tribe or a o-algebra, a barbarous expression!” 
used by abstract measure specialists. These results enable us to construct 
more and more complicated measurable sets from open or closed sets: count- 
able intersections of countable unions of countable intersections of...As no 
one has ever constructed a non- measure set without resorting to the axiom 
of choice in some form or other, it may be thought that none are encountered 
in classical mathematics. In fact, as the Lebesgue-Fubini theorem will show, 
there are fields where caution is warranted. 

A tribe is a subset of the set P(X) of subsets of X (Chap. I, n° 4), 
and it readily follows from the definition that any intersection of tribes is 
again a tribe. Hence, one may talk of a smallest tribe containing the open 
sets. By definition, its elements are the Borel sets. They are measurable 
with respect to all measures w on X, which is why they are of interest. The 
next simplest ones are the countable intersections of open sets (the G's sets in 
usual notation, a category containing the closed subsets if X is metrizable!), 
then the countable unions of G5 sets (the G5, sets), then the countable 
intersections of G's, sets (the G5,5 sets), etc. From these classes of sets, in 
general pairwise distinct and each larger than the preceding one, the reader 
may get the impression that all Borel sets can be obtained by this rather 
simple mechanical process. False. Choosing in each of these classes a set not 
belonging to any of the previous ones gives the union of a countable family 
of Borel sets that does not in general belong to any of these classes. The 
construction would need to be continued beyond countability. 


17 Tn the special issue of Pour la Science dedicated to N. Bourbaki, it is said that 
the terms “clan, phratry, tribe” used by this author before he decided to reject 
abstract measures show his lack of taste for the topic. It was more a matter 
of wishing to express himself in French. The English translation of N. Bour- 
baki’s book came back to o-algebras, probably because of what the English and 
Americans themselves call the NIH syndrome (not invented here)... 

18 If X is a metric space, any closed subset F is a Gs set since it is the intersection 
of the open sets defined by the inequalities d(x, F) < 1/n. I take it that the 
letters 6 and G comes from the German for intersection, Durchschnitt and open 
geffnet. 
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The notion of a tribe explains the theory of abstract measures.!? Take a 
set E without topology, a tribe of subsets of EF and a function py on it with 
values in [0,+00] having the same formal properties (countable additivity) 
as a Radon measure. This leads to the definition of integrable functions, L? 
spaces and to Lebesgue’s theorems. For example, if there is a Radon measure 
on a locally compact space X and if FE is taken to be a measurable subset 
of X, the measurable sets contained in E form a tribe on which the function 
(A) = A(A) is a measure on F in the previous sense. If EF is locally compact 
(i.e. the intersection of an open and a closed set), js is the Radon measure on 
E which can be directly defined by putting 


uf) = AF) 


where, for all f € L(E), f’ denotes the function equal to f on E and 0 
elsewhere. As a compact subset of E is also a compact subset of X, f’ € L(X). 
vis said to be the measure induced by A on E. For example, this enables us to 
talk of the Lebesgue measure on a locally compact subset of a space R”. If 
is defined by the previous formula, we immediately encounter the problem of 
having to prove the following result, “obvious from a physics viewpoint”: a 
function f on E is integrable with respect to yu if and only if so is its extension 
f’ with respect to A, in which case the previous formula continues to hold. 
This in fact require a non-trivial proof.?° 


7 — Measurable Functions 


(i) Separable spaces. Let us consider a locally compact space X, a positive 
measure 4 on X and a sequence of maps f, from X to a metric space P. 
Suppose that the sequence (f,) converges everywhere to a limit function 
f(x). For a given a € P and a given number R > 0, let A Cc X be the set of 
x where 


d[f(x),a] < R. 


We intend to compute it using similar sets with respect to the functions fp. 
The relation « € A means that there are integers m and & (dependent on 
x) such that 


(7.1) d|fn(z),a]< R-1/m foralln>k. 
We set 
Amn = {d[fn(x),a] << R-1/m}. 
For given m and k, (1) clearly means that 
rE () Amn - 
n>k 


19 See for example Chap. 1 in Walter Rudin’s Real and Complex Analysis. 
20 See N. Bourbaki, Integration, Chap. V. 
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Hence 


(7.2) Aa Nh (as 


m,k>O0n>k 


If, instead of converging everywhere, the sequence f,, converged almost every- 
where to f, the result would obviously remain the same up to a null set. This 
shows that if the inverse images of the open balls of P under the functions 
fn ave measurable, then f also has the same property. 

This argument, however, relies on using a metric d defining the topology 
on P. To obtain a result involving only the topology on P, we would need 
to show that it is preserved if d is replaced by another metric compatible 
with the topology on P. Now, any open d’-ball (and more generally any open 
subset of P) is then a union of open d-balls. Hence there is no problem if 
every open d’-ball is known to be the countable union of open d-balls and 
conversely. It is simplest to suppose that all open subsets U of P is the 
countable union of open balls, a property independent of the chosen metric 
since a family (Bmn) dependent on two integer indices is still countable. 

This is for example the case if P = R”: every open subset of P are the 
unions of open balls of rational radii and centred at points with rational 
coordinates. Let us suppose that in the general case there is a countable and 
everywhere dense set D in the metric space P. P is then said to be separable. 
If U is open in P and if a € U, U contains an open ball B(a,r) with r € Q, and 
the ball B(a,r/2) contains some d € D. The ball B(d,r/2) is then contained 
in U and contains a. Since a € U is arbitrary, U is the union of open balls 
centred at points of D and of rational radii, whence the result. 

The existence of D is equivalent to the next property: 


(TD) There is a countable family of open set (U;,) such that any open 
subset of P is the union of those U,, contained in it. 


We saw above that (TD) is clearly necessary. Conversely, if (TD) is satisfied 
and if, for all n, some a, € U,, is chosen arbitrarily, the set D of the points 
Gyn, meets every open subset of P, and so is everywhere dense. For metrizable 
spaces, (TD) is thus equivalent to separability. 

A separated topological space (Hausdorff axiom) satisfying (TD) is said 
to be of countable type; any family (U;,) satisfying (TD) is called a basis for 
the topology of X. We will return in much greater detail to all this in n° 11. 


(ii) Measurable maps. After these preliminaries, let us return to X and 
to the measure p. A map f from X to a metrizable and separable space P is 
said to be measurable?! when it satisfies the condition 


(FM): the inverse image under f of every open subset of P is measurable. 


21 A definition applicable to all topological spaces will be given later. 
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Every continuous map is obviously measurable. The arguments that have led 
to relation (2) then prove the next result: 


Theorem 12. If a sequence of measurable functions with values in a metriz- 
able and separable space converges almost everywhere, then the limit function 
is measurable. 


An immediate corollary: 
Theorem 13. All functions belonging to a L” space are measurable. 


Because it is the limit almost everywhere of a sequence of continuous 
functions with values in C, the archetype of a metrizable and separable space. 

For a function f with values in R or, more generally, in [—oo, +00], mea- 
surability means that f~'(I) is measurable for every interval J, including 
for example if I = {+00}, or {—oo}, or [8,+00[, etc.. It would even suf- 
fice to require this for intervals of the form J = [a,+o0o] where a is finite 
since every other interval can be obtained from this type of intervals using 
a countably infinite number of standard operations. For example, [—o0o, al is 
the complement of [a, -+oo], and [0, a[ = [0, +00] N [—0«, al. So 


[0, 1] = [0, +00] Nf) [00,1 + 1/nf. 


Theorem 13 continues to hold for these functions since they are anyhow finite 
almost everywhere. We will see later that, modulo the condition N,(f) < 
+oo, this theorem characterizes the functions of L?. 
A measurable function remains so if it is modified on a null set. Hence 
the definition can be applied to functions defined almost everywhere. 
Property (FM) can be generalized: 


Theorem 14. Let P be a metrizable and separable set. The inverse image of 
every Borel set B C P under a measurable map f : X —>+ P is measurable. 


Let T be the set of subsets E of P such that f~'(E) is measurable. Ele- 
mentary formulae about the inverse image of an intersection or a complement 
show that, like the family of measurable subsets of X, 7 is a tribe. However, 
by definition, TJ contains all the open subsets of P, and hence also all its 
Borel subsets. 


Lemma 1. Let fi,..., fn be maps from X to metrizable and separable sets 
P,,..., Py. Then the map 


friars (fila),---,fn(x)) 


from X to the product space P = Py x ... x Py is measurable if and only if 
so are the maps f;. 


First, the product space is metrizable and separable since if, for each 2, 
D,, is everywhere dense in P;, then the product of the sets D; is everywhere 
dense in P. If f is measurable, then, for any Borel set B C P;, the set 
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FOB Pox PHF CB) 


is measurable. Hence the condition is necessary. 
To show that it is sufficient, note that, if U; is an open subset of P;, the 
product of the sets U; is open in P,. Since 


fia (hie eee a0 — Of), 


the left hand side is measurable. Hence so is f~'(U) for any open set U C P 
if U is shown to be the countable union of a product of open sets. To see this, 
we equip P with the distance 


d(a, b) = sup d; (a;, b;) , 


where d; is the distance on P;. It defines the topology of the product. The 
open balls of the product are then products of open balls in the sets P;, which 
are unions of open balls of rational radii centred at points of countable and 
everywhere dense sets D; C P;, qed. 

In the next statement, if P and Q are separable metric spaces, y : P —> Q 
will be said to be a Borel map if, for any open ball V C Q, y~!(V) is a Borel 
subset of P. Then, more generally, the same holds for all Borel sets V since 
the set of subsets V for which f~'(V) is a Borel set is a tribe. A Borel map 
from X to a separable and metrizable space is, therefore, measurable. The 
argument used to prove theorem 12 shows that the limit f of a sequence of 
Borel maps fn converging EVERYWHERE is a Borel map,” since the sets 
Am,n Of relation (2) and so A are then Borel sets. For example, any regulated 
function on R is a Borel map. 


Lemma 2. Let P and Q be two separable and metrizable spaces, y a Borel 
map from P to Q and f a measurable map from X to P. Theng=gof: 
X — Q is measurable. 


It suffices to write that g~'(B) = f~1(B’), where B’ = y~!(B). 


Theorem 15. Let P,...,P, and Q be separable and metrizable spaces, 
a Borel map from P, x... x Py, to Q and f; : X — P; measurable maps. 
Then the map 


rt— lfi(z),---, fn(z)] 
is measurable. 


This is a direct consequence of the previous two lemma. 
Theorems 12 and 15 enable us to check that all usual operations (quasi- 
algebraic formulae, passages to the limit) that can be performed on measurable 


22 Were convergence ae. sufficient, then (among others) any integrable function 
would be a Borel map, any null set would be a Borel set, and hence so would be 
any subset of such a set, etc. 
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functions lead to measurable functions. In fact, far more ingenuity would be 
needed to invent, using explicit formulae, non-measurable maps, possibly even 
non-Borel maps. However, see end of n° 11: the image of a measurable (resp. 
Borel) set under a continuous map may not be a measurable (resp. Borel) 
map, including in the case of the projection (x,y) 4 x from R? onto R. 

The set of measurable functions with values in C (or in a separable Banach 
space) is a vector space since (u,v) ++ au+ Bu from C x C to C is a Borel 
map. Together with theorem 12, this result shows that if a series > f,(x) of 
measurable functions converges ae., its sum, i.e. the limit of its partial sums, 
is measurable. The same holds for functions f,, with values in [—oo, +00], 
provided the expression (—oo) + (+00) is given a meaning. This can be done 
(in integration theory...) by assigning once and for all an arbitrary value to 
it since, if (fn) is a countable family with values in [—oo, +00], the set of x 
where —oo and +oc0 appear in the sequence (f,,(a)) is measurable as it is the 
intersection of the measurable sets 


fr’ ({-0o}) 9 fa * ({+00}) - 


The product of two complex-valued measurable functions is measurable. 
This result holds for any continuous bilinear map. For example, if f and g 
take values in a separable Hilbert space, the function « +> (f(x)|g(x)) is 
measurable. If f with values in a Banach space F is measurable, so is the 
function x + || f(x)|| since u + ||u|| is a continuous map from F to R. If f and 
g, with values in a separable and metrizable space P, are measurable, so is the 
function «+> d[f(x), g(x)] with respect to any distance compatible with the 
topology of P, since the map (u,v) +> d(u,v) from P x P to R is continuous. 
If f and g, with values in [—oo,+co], are measurable, so are sup(f,g) and 
inf(f,g). Thanks to theorem 12, we can deduce that the upper and lower 
envelopes of a countable family of real measure functions are measurable. 
Etc. 

Exercise. Define the measurable functions with values in the Riemann 
sphere C [Chap. VIII, n° 5, (vi)] and show that if f is measurable, then so is 
1/f. Define the map x +> 1/2 from [—co, +00] to itself by agreeing, either that 
1/0 = +00, or —oo, or any other value. Show that if f : X —> [—o0, +00] is 
measurable, then so is 1/f in all three cases. 


8 — Measurability and Continuity 


(i) Egorov’s and Lusin’s theorems. We saw above that any limit ae. of mea- 
surable functions is measurable. There is actually a more precise result whose 
proof, like the calculations of n° 7, (i), is a set theory exercise: 


Theorem 16 (Egorov). Let (fn) be a sequence of measurable functions 
with values in a separable metric space P and converging ae. to a function 
f :X — P. For alle > 0 and all integrable sets A C X, there exists an 
integrable set B C A such that u(A — B) < « and in which the sequence 
(fn(x)) converges uniformly to f(a). 
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Removing a null set from A, we may suppose that lim f,(x) = f(x) for 
all « € A without exception. Henceforth, all arguments apply exclusively to 
A. 


For integers m,p > 1, consider the measurable set 


Am(p) = {4 (f(x); fn(@)] < 1/m for all n > p} . 
Obviously 
(8.1) Am(p) C Am(p +1). 
By definition of simple convergence, 


(8.2) A=|JAm(p) for all m. 


p2i 


Convergence is uniform in a subset B of A if and only if, for all m, there 
exists p such that B C A,,(p), in other words, if and only if there exists a 
sequence (p,,) such that B C ()Am(Pm). Therefore, the proof reduces to 
showing that there exist integers p,, such that the set 


(8.3) B= (Am (Pm) 


satisfies u(B) > u(A)—«. As A— B= U[A-—Am(pm)], for this to be the 
case it suffices that 


(8.4) H(A) — 1 [Am (Pm)] < €/2” 


for all m. Since for given m, by (1), the sequence (A;(p)) is increasing, for 
each m, the existence of an integer p,,, satisfying (4) follows from (2), qed. 
Using Egorov’s theorem, measurability can be related to continuity. In- 
deed, the function f is the limit ae. of a sequence of functions continuous 
everywhere — for example if f belongs to a L? space -, and if the result is 
applied to a compact set kK C X, then f has the following property: 


(LUS) For all compact sets K C X and all € > 0, there exists a compact 
set Kk’ C K such that w(K — Kk’) < € and such that the restriction of f 
to K’ is continuous.?% 


More generally: 


Theorem 17 (Lusin). A function f with values in a metrizable and sepa- 
rable space P is measurable if and only if f satisfies condition (LUS). 


23 Tn the following, by abuse of language, we will often say that “f is continuous 
on A” meaning that the restriction of f to A is continuous. This does not 
mean that f is continuous at every point x € A as a function on X (example: 
the characteristic function of A is continuous on A in the previous sense, but 
discontinuous at every boundary point of A as a function on X). 
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We break up the proof of this result into several parts. 

(a) Let us first suppose that there is a countable partition of X into 
measurable sets E,, such that f is constant on each of them. We will then say 
that f is a step function. This is a less trivial analogue of the step functions of 
Chap. V, n° 1. All functions of this type are measurable. Indeed, if f(z) = un 
on E,, and if U is a subset of P, the set f~'(U) is the union of the sets Ep, 
for n such that u, € U. Hence it is measurable. 

(b) Let us now show that all step functions f have the stated property. 
Indeed, let K be a compact set. K is the disjoint union of integrable sets 
KO E,. Hence, for « > 0 and all n, there is a compact set K, C KM Ey 
such that w(K NE, — Kn) < &, = €/2”. Since f is constant on each Ey, it 
is continuous on each K,,, hence also on 


ye ie eae eo 


for all n since the K; are closed and pairwise disjoint. But 


w(K — Ki) = > w(K Ep— Ky) + >) w(K Ep) - 


psn pon 


The second sum is arbitrarily small for large n since 


Sou (KN Ey) = w(K) < +00 


pal 


and the first one is bounded above by €; +...+€, < €, qed. 

(c) If there is sequence (f,,) of measurable functions, then for all « > 0, 
there is a compact set K’ C K such that w(K —K’) < € and on which all func- 
tions f, are continuous. Indeed, for each n, there are sets K, C K on which 
fn is continuous and such that uke — K,) < €/2”. Then w(K — (| Kn) < € 
and the compact set K’ = ()K, answers the question. 

(d) Let f : X —>+ P be an arbitrary measurable map. To show that it 
has Lusin’s property, choose a distance d on P and a number r > 0. As P is 
separable, it is the union of a sequence of open balls A, of radius r. Replacing 
the A,, by the sets 


Be a ie AA Agta Be 


we get a partition of P into Borel sets of diameter < 2r. Since f is measurable, 
the sets E, = f~'(Bn) form a partition of X into measurable sets on which 
the function f is constant up to 2r. For each n, we choose some x, € EF, and 
we define a step map f from X to P by the condition g(x) = f(#,) on Ep. 
Then the uniform distance d(f,g) = sup d[ f(x), g(x)] is clearly < 2r. 

Giving values of type 1/n to r, we thus see that f is the uniform limit 
in X of a sequence of measurable step functions g, with values in P. Now, 
section (c) of the proof shows that there is a compact set K’ C K satisfying 
p(k — K") < € and on which the functions g,, are continuous. Hence so is f, 
and property (LUS) follows for f. 
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(e) It remains to prove that conversely, any function f having this property 
is measurable, i.e. that the set f~'(U) M K is integrable for any open set 
U Cc P and any compact set kK C X. By assumption, K is the union of a 
sequence of compact sets K,, on each of which f is continuous and a null set 
N. Therefore, f~'(U)M Ky, is open in K,,, and so is a Borel set, and thus is 
measurable. Hence, so is f~'(U)K, the union of thef~!(U)N Ky, and a null 
set. As f-'(U) 1 K has finite outer measure, it is integrable [n° 6, property 
(iv)], qed. 

The proof highlights the analogy between continuity and measurability: 
f is continuous (resp. measurable) if and only if there is a countable cover 
of X by open (resp. measurable) sets on which f is constant up to ¢. This 
obviously supposes that P is separable. 

Here is a useful application of Lusin’s theorem: 


Corollary. Let X be a locally compact space, 4 a positive measure on X 
and j a locally integrable function with respect to [. 


[ foite)aute) =0 forall f € L(X) 


if and only if j(x) = 0 almost everywhere on every compact set?* K Cc X 
[hence j(z) = 0 almost everywhere if X is countable at infinity or if py is 
reasonable]. 


The condition is clearly sufficient. To show that it is necessary, we argue by 
contradiction. As j is measurable, there is a compact set K and a measurable 
set A C K with non-zero measure such that j(x) 4 0 for all « € A. Lusin’s 
theorem enables us to assume that A is compact and that j continuous on A. 
The function j(x)x4(x) being integrable by assumption or definition, there is 
a sequence of functions f, € L(X) converging almost everywhere to jy. As 
jva vanishes outside the compact set A, the f, may be assumed to be zero 
outside a compact neighbourhood B of A. As jy,4 is bounded, the functions 
|fn| can also be assumed to be bounded above by a constant M independent 
of n (corollary 3 of theorem 7). Then the functions f,(«)j(x) converge almost 
everywhere to |j(x)|?x.4(x) and are dominated by the function Mj(x)x4(2), 
which are integrable by assumption. Hence 


i li()[? x4 (0)du() = lim i() fr(a)du(x) =0, 


and thus j(a) = 0 almost everywhere on A, a contradiction. 

It should be noted that the condition in the statement says that the 
measure j(x)du(a) with density j [n° 5, (ii)] is zero. 

Exercise 1. Let K be a compact set and f a function vanishing outside 
kK and Isc on Kk. Show that f is measurable. Deduce that for real functions 


4 This means that the set N = {j(X) 4 0} satisfies u(NMK) = 0 for any compact 
set. Such a set is said to be locally null. 
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with values < +00, “continuous” can be replaced by “Isc” in the (LUS) 
statement. 


(ii) Lusin-measurable functions. Apart from the importance of Lusin’s 
theorem for its own sake, it can be used to construct a theory of measurable 
functions with values in a general (separated) topological space: by definition, 
these are maps f having property (LUS) of Lusin’s theorem. This point of 
view, taken by N. Bourbaki, often enables us to free ourselves from separabil- 
ity assumptions. Though they hold in most applications, as we will see later 
in relation to the vague topology on measure spaces, in functional analysis, 
we also need to for example consider functions with values in non-separable 
Banach spaces and even in non-metrizable topological vector spaces. In the 
exercises below, (LUS) is taken as the definition of measurability. 

Exercise 2. A function equal almost everywhere to a measurable function 
is measurable. 

Exercise 3. If the restrictions of a function f to a sequence of measurable 
sets A, are continuous and if X = NUUAn, where p(N) = 0, then f is 
measurable. 

Exercise 4. Let f be a measurable map from X to a topological space P. 
Show that f~1(B) is measurable for every Borel set B C P (consider first 
the case of open sets). The converse may be false if X is not separable. 

Exercise 5. Let f; be finitely many maps from X to topological spaces P; 
and g a continuous map from the Cartesian product of the P; to a topological 
space Q. Show that the map 


vt glfilz),---, fn(x)] 


from X to Q is measurable. 

Exercise 6. Let us say that a family (f;) of measurable maps from X to a 
topological space P is equimeasurable if for all compact sets K and all € > 0, 
there exists a compact set K’ C K such that w(K — K’) < € and on which 
the maps f; are continuous. Show that this is always the case for a countable 
family of measurable functions. 

Exercise 7. Show that Egorov’s theorem continues to hold for functions 
with values in an arbitrary metrizable space (use exercises 4 and 6). 

Exercise 8. (a) Let A be a compact subset of a Banach space H. Show 
that the closed vector subspace of H generated by A is separable. (b) Let 
X be a locally compact space, 4s a measure on X and f a map from X to 
H. Suppose that f is measurable and reasonable. Show that there exists a 
separable closed subspace H’ of H such that f(a) € H’ almost everywhere. 

The importance of equimeasurable families is due to the next result, which 
will prove useful later: 


Lemma 1. Let (fi)ier be an equimeasurable and increasing filtering family of 
functions with values in [0, +00]. The function sup fi(x) = f(x) is measurable 
and 
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(8.5) MO” (Xe f) = sup H" (Xie fi) 
for all compact sets K. If f is reasonable, then 
(8.6) u"(f) = supp" (fi) - 


Let us first suppose that f and hence that the functions f; are bounded. 
If K is a compact set on which all the functions f; are continuous, then the 
function f is lsc on K, so that y,.f is measurable (exercise 1), and hence is 
continuous on a compact set A’ C K such that (i — K’) is arbitrarily small. 
Thus f satisfies condition (LUS), and so is measurable. As f is continuous on 
Kk’, the functions f; converge uniformly to f on K (Dini’s theorem), which 
proves (5) for K’ since 


MOK fi — xK' FY) Se (E’) fi - fllice - 


The case of an arbitrary compact set K is obtained by passing to the limit 
using an increasing sequence of compact sets K,, C kK whose union is K, up 
to a null set. If the set { f(x) #0}, which is measurable, is reasonable [use 
property (v) of n° 6], then (6) is proved in a similar manner. 

If f is not bounded, consider the family of functions inf(f;,n) = fin 
with i € J and n € N. Like the given family, it is equimeasurable. It also 
an increasing filtering family for fixed i and varying n, or for fixed n and 
varying 7, or for varying i and n. For fixed 7, its upper envelope is f;; for 
fixed n, it is f, = inf(f,n), a function which the first part of the proof tells 
us is measurable; hence so is f = sup f,. Therefore (associativity!), supposing 
first that f vanishes outside a compact set, 


w(f) = sup LM (fr) = supsup LK (fin) 


= sup sup #1” (fin) = sup n* (fi) ; 
this proves (5) in the general case, and (6) as above if f is reasonable. 


9 — Measurability and Integrability 


The fact that a measurable set of finite outer measure is integrable (n° 6) is 
merely a particular case of the next result: 


Theorem 18. A function f with values in?° C or [—o0, +00] is in LP if and 
only if it is measurable and N,(f) < +00. 


(a) As |f(x)| < +00 ae., we need only consider the case of finite-valued 
function. By Lusin, there is a sequence of compact sets K,, which may be 
assumed to be increasing, such that f is continuous on each K, and zero 


5 . . . “ys . 
5 or in a Banach space H, provided Lusin’s definition is used. 
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ae. outside the union of these K,, because N,(f) < +00. Let f,, denote the 
function equal to f on K, and 0 elsewhere; then f(x) = lim f,(x) ae. and 
lfn(z)| < |f(a)| ae. So the proof reduces (dominated convergence) to showing 
that f, € LD? for all n. Thus, it reduces to the case where f is zero outside a 
compact set K and is continuous on K, which we will now suppose. 

(b) For all n > 1, K may be covered by finitely many open subsets Uz,n 
of X such that f is constant on KM Uz», up to 1/n. As in point (d) in the 
proof of Lusin’s theorem, for given n, the Borel sets K 1 U;,,,, can be replaced 
by pairwise disjoint Borel sets Ay. Let uxn = f(@kn), where Ayn € Ag n is 
chosen arbitrarily, and consider a step function f, equal to uz. in Ax,» and 
0 outside K . Denoting the characteristic function of Ag, by Xx,n; 


fn(x) = S- Xkn(@)URn everywhere. 
K 


We already know that the characteristic function of a measurable set of finite 
outer measure is integrable; hence (n° 5, theorem 10) is in L? for all p, so are 
the functions f,. In Ag n, 


|f(@) — fn(@)| = |F(@) — Ukn| <1/n, 


and so 


|fn(@)| S [F(a)] + Xx (2): 


As everything vanishes outside the union K of the sets A;,,, one can conclude 
that the sequence f,, converges everywhere to f while remaining dominated 
by the function g = |f| + x,. However, its norm N, is finite like those of f 
and x,,. The dominated convergence theorem then shows that the limit f of 
the functions f,, is in L?, proving the theorem. 

Its usefulness is due to the already mentioned fact that in practice, and to 
a large extent also in theory, all functions that one encounters are measurable. 
It then suffices to check that N,(f) is finite, which in general only requires 
finding easy upper bounds. 


Corollary 1. The product of a function of L? and a bounded measurable 
function is in L?. 


Obvious. 


Corollary 2. Let f be a reasonable and measurable function. f € L? if and 
only if 


sup iE Ife)? dul) < +00. 


KCX 


52 XI — Integration and Fourier Transform 


The condition is obviously necessary. As f vanishes outside the union of 
a null set and an increasing sequence of compact sets K,,, by theorem 2, 


we” (FP) = Tim 2 (FP Xa) 5 


hence the result. It can be applied for any measure on R by taking K, = 
[—n, nl]. 


Corollary 3. Let f be a reasonable and measurable function with values 
in [0,00]. Then there is an increasing sequence (fn) of bounded positive 
integrable functions such that 


f(z) =limf,(x) for all xe Xx. 


Indeed, let A, be an increasing sequence of integrable sets whose union 
is the measurable set { f(x) 4 0}. We set 


inf[f(z),n] if a € A,, 
play =< MUO if re 


0. otherwise. 


fn is the product of the measurable function inf(f,n) and the characteristic 
function of A,, and so is measurable and even integrable since p*(|fnl) < 
nyi( A,) < +00. These functions answer the question. 
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4. Lebesgue-Fubini’s Wa 
gs y 


10 — The Lebesgue-Fubini Theorem (LF) 


(i) Product of measures. Let X and Y be two locally compact spaces, and ju 
two positive measures on X and Y. Like in Chap. V, the product measure v 
on X x Y is given by the formula 


aor) ff fanart = fancy [Fle uautu) 


for all functions f € L(X x Y). Checking that (1) is well-defined is the only 
issue since if that is the case then obviously, the result depends linearly on f 
and is > Oif f > 0. 

For given x, integration over Y does not raise any problems since the 
function fr: y f(x,y) is in L(Y). On the other hand, f vanishes outside a 
product K x H, where K C X and H CY are compact.?° The integral in y is 
thus zero for x ¢ K. It remains to show that the function z > f[ f(a, y)du(y) 
is continuous, and hence that, as x tends to some a € X, the function f,(y) = 
f(x,y) converges uniformly to fa(y) on Y, in other words that, for all r > 0, 
there is a neighbourhood U of a in X such that 


\f(x,y) — f(a,y)| <r for all re€U andall ye d. 


There are general theorems on the topic (uniform continuity in the classical 
case), but we may as well argue directly. As f is continuous, for all y, there is 
a neighbourhood of the point (a, y) in X x Y on which f is constant, up to r. 
This neighbourhood may be supposed to be the product of a neighbourhood 
U(a,y) of ain X and a neighbourhood V(y,a) of y in Y. As H is compact, 
it can be covered by finitely many sets V(y;,a). Then the set U = () U(a, y:) 
clearly answers the question. 
It goes without saying that the formula 


[ore [ tend) = | an f tenance), 


proved in Chap. V for particular cases, can be extended to the general case. 
One shows that functions f € L(X x Y) vanishing outside a compact set 
K x H are uniform limits of finite sums 5° kp)(x)hq(y), with functions kp, € 
L(X) and hg € L(Y) that may be assumed to vanish outside fixed compact 
neighbourhoods K’ and H’ of K and H: uniform continuity together with 
the existence of partitions of unity [Chap. IX, n° 17, (ii), lemma 4]. One may 
also apply the Stone-Weierstrass to the sums of functions k()h(y). We leave 
it to the reader to fill in the gaps in this “proof”. 


26 For example for K and H take projections of a compact set A C X x Y outside 
which f vanishes. 
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This done, the LF theorem aims, if not to characterize in a precise manner 
integrable functions f with respect to the product measure — it can be done 
if f is known in advance to be measurable with respect to v —, at least to 
generalize formula (1) to these function, as was done in Chap. V in very 
particular cases. 


(ii) The Lebesque-Fubini theorem. 


Theorem 19. Let X and Y be two locally compact spaces, X and 4 measures 
on X andY,v = \xp the product measure and f € L1(X xY,v) an integrable 
function with respect to v. Then: 

(i) the function y > f(a,y) ts p-integrable for almost all x € X, 

(ii) the function x ++ f f(x, y)duly), defined almost everywhere, is A- 
integrable, 

(iti) the next equality holds: 


02) wA= ff savancyduiy) = farce) [ se,v)auty). 
XxY X Y 
(iv) For any reasonable and measurable function f with respect to v,?" 


(10.2°) v* (fl) = / dd (a) / If(o,y)| du" (y); 


f ts integrable with respect to v if and only if 
(10.2") [xe [iewlae Wy) < +0. 


Obviously, results permuting the order of integration should also be men- 
tioned, in particular one should take not of the constantly used formula 


fae [rendu = fan | tenance, 


valid for all integrable functions with respect to the product measure. If f 
is measurable, it holds whenever the right hand side of (2’) — or else the 
similar expression obtained by switching the roles played by and ps — is 
finite. Finally, similar results clearly hold for all functions f belonging to an 
L” space: it suffices to apply the theorem to the function |f|?~1f. 

Exercise 1. Interpret the theorem when one or both spaces are discrete. 

The proof of the LF theorem comprises several parts. In what follows, the 
letters f and y will denote functions defined on Z and we shall write f,(y) 


for f(x,y). 


°” Recall that, generally speaking, one writes [ f(x)d\* (x) = A*(f) for all measures 
and positive functions. 
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For f € L(Z), the definition of v can then be written as 


v(f) = / ulfe)AX(22) 


(a) Let us start with a positive lsc function y on Z = X x Y and the set 
Lint(y) of f € L4(Z) such that f < y. By definition, 


v"(y) = sup v(f) = sup / w(fe)d(c). 


Lins (y) 


The functions f, are continuous, their set is an increasing philtre since 
Lint(y), and their upper envelope is yz. Thus (theorem 1) 


sup U( fr) = L" (Ye) 5 


which shows that the function 7 +> j1*(~,) is lsc. As the functions 4 +> pu( fr) 
are also continuous and form an increasing philtre, theorem 1 shows that 


ae / wlfe)dX(e) = / B"(@e)dr* (2). 
So the relation 
(10.3) v*(y) = ‘| u" (on)aM* (x) 


finally follows for all positive lsc functions on Z. 

(b) We next consider an arbitrary function f with values in [0, +00]. By 
definition, v*(f) is the infimum of the numbers v* (vy) for Isc functions y > f. 
Since this inequality implies u*(y,) > u*(f,) for all x, and hence by (3) that 


(> fu (ts\aX"(e), 
it follows that 
(10.4) v(f)> fe isar@) for all f >0 on Z. 


(c) Let us in particular suppose that f is the characteristic function of 
a null set N Cc Z. The function fz is then the characteristic function of the 
“vertical cut” N,. of N, i.e. the set of y such that (x,y) € N. The right hand 
side of (4) is zero, and so (n° 2, (iii), theorem 4) 


(10.5) p*(N,) =0 for almost all re X. 


(d) Now let f be an integrable function with values in C or [—oo, +00] (or 
a Banach space). Denoting by N; the norm L* with respect to the measure 
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v, there is (n° 4, (ii), theorem 7) a null set N C Z with respect to v and a 
series of functions f, € L(Z) such that 


S > Ni(fn) < +00 & flz = G2) forall ze Z—N. 


The series }> f, then converges in mean to f in L'(Z,v) and 


(10.6) v(t) = fe) =o / 1 (Fan) M2) 


However, 


(10.7) fw llfnal) Xe) =v (fal) = OM (In) < +00. 


Hence, by theorem 4, >> u(|fn,x|) < -+-oo outside a A-null set M’ Cc X, and 
the series fy. converges in L'(Y,) for all e € X — M’. 

But > f(z) = f(z) for all z © Z—N and, by (5), we know that u*(N,) = 
0 for all x ¢ M”, where M” is a null set with respect to \. Ifa ¢ M’UM"” = 
M, then the series S> fn.z(y) converges both p-almost everywhere to fz(y) 
and in L1(Y, 2). So, for all « ¢ M, (theorem 7) 


(10.8) foe L(Yip) & ufo) =>) ure) 


Finally, as 33 |t(fn,x)| < 35 uC fn,2|), theorem 6 on series shows that the 
function >> 4(fn,2) is A-integrable and that 


[Xen XO) = Tf wna) ae) = 0) 


by (8). As 2 W(fne) = Mfr) for 2 ¢ M by (8), the function x +> (fe), 
defined almost everywhere with respect to A, is A-integrable, with in addition 


(10.9) V(f) = i u(fe)dd(e) 


which proves propositions (ii) and (iii) of the statement. 

(e) To prove (iv), let f be a reasonable function with values in [0, +oo] and 
v-measurable. It is the simple limit of an increasing sequence of v-integrable 
functions f, > 0 converging everywhere (n° 9, corollary 3 of theorem 18). 
Theorem 2 of n° 2 then shows that 


v*(f) =limy* (fr) = lim (fr) 


and so, by proposition (iii) of the theorem, that 


(f) = lim i * (fae) Ma). 
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However, the functions 7 + w*(fr.), with values in [0,+00], form an in- 
creasing sequence like (f,,). Applying theorem 2 to » then leads to 


v*(f) = fee tim (fra) - 
Since the same theorem 2, now applied to ys, shows that 


lim pt (fn,x) =" (fe) 


(f= fu" (fs) ana), 


Integrability condition (2”) readily follows, qed. 

Exercise 2. Suppose that X is the set R equipped with the discrete topol- 
ogy, that Y = R is equipped with the usual topology, that A({a}) = 1 for all 
x € X and that ps is the Lebesgue measure. Show that 


u(y) = D f oe.ndy 


for any positive Isc function v.78 Let f(x,y) be the (unreasonable) function 
equal to 1 if y € Q and 0 otherwise. Show that f is measurable with respect 
to v and that for this function, the left hand side of (2’) is infinite and the 
right hand side zero. 


(iii) Additions to the LF theorem. Let us now present some consequences 
of the LF theorem and additions to it. 

Applying it to a characteristic function shows that: 

(C1) Let EC X x Y be an integrable set; then the set E, of y€ Y such 
that (2,y) € E is p-integrable for almost all x, the function «+> p(E,) is 
A-integrable and 


(10.10) 1B) = | w(B.) Aa), 


a result already obtained in Chap. 5, n° 33 by assuming EF to be compact. In 
particular, 


(10.11) v(E£) =0 w(E,) =0 for almost all x. 
By (2’), this equivalence continues to hold for all v-measurable sets EC 


XxyY. 


8 The sum of the right hand side series is, by definition, the supremum of its partial 
sums (unconditional convergence for a series extended to an uncountable set). 
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(C2) IfU CX and V CY are open, then 
v*(U x V) = A¥*(U) pr (V). 


Obvious by (3). 
(C3) If K c X and H CY are compact, then 


v(K x H) = \(K)u(H). 


Apply (2) to the characteristic function of the compact set K x H. 
(C4) If AC X and B CY are integrable, A x B is integrable and 


(10.12) v(A x B) = \A)u(B). 


If a product A x B is integrable and if neither A nor B is a null set, then 
both A and B are integrable. 
Indeed, for all € > 0, there are compact and open sets such that 


KcCACU, \(U — K) <e, 
HcBCcV, WV — H) <e. 


AsUxV-KxHcCcUx(V-—H)U(U-K)xV, 
v(U x V)—v(K x A) < XU) u(H — U) + XU — K)Wl(V), 


a quantity <r for sufficiently small «. Hence the first proposition. 

Conversely, let us suppose A x B = F to be integrable. Then, for all 
xz € X, E, = B or @ depending on whether x belongs to A or not. However, 
by (C1), the set of x for which E, is not integrable has measure zero. Hence, 
if A is not a null set, then B is integrable, qed. 

(C5) If N CX is a null set and if B CY is reasonable, then N x B has 
measure zero with respect to v. 

This is obvious by (C4) since B = NUL) K,,. The assumption on B always 
holds if « is countable at infinity; otherwise, N x Y may well not be a null 
set. 

Let us for example suppose that Y is an uncountable discrete space and 
that w(f) = >> f(y) for any function f € L(Y), ie. with finite support, 
whence obviously pu*(f) = >> f(y) for any function f > 0. If the product 
E = N x/Y is a null set, there is a lsc function y on X x Y which is 
everywhere positive, > 1 on N x Y and such that v*(y) < +00. But, by (3), 


vy=> / ola, yar" (a) , 


and this sum is finite only if the set of y such that f v(x, y)dA*(x) 4 0 is 
countable. Thus the set of x such that y(z,y) > 0 must have measure zero 
for uncountably many values of y. However, for all y, this set contains N and 
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is open since y est Isc. We, therefore, infer that N x Y may be a null set only 
if N is contained in an open null set, a condition which, by (C2), is anyhow 
sufficient. Hence if X = R is chosen to have the Lebesgue measure and Y to 
be as indicated, then N x Y is a null set only if N is empty. Measures of this 
type are not encountered in Nature, but that of mathematicians is not that 
of physicists. 

(C6) If f € L?(X, >) and g € L?(Y, pu), then the function 


fxg: (z,y)—> f(z)gly) 


is in L?(X x Y,v) and 


(10.13) If * ally = lllollglle 
(10.13") v(f xg) =X fu(g) for p= 1. 


Finite sums of such functions are everywhere dense in L?(X x Y). 
To show this, choose f,, € L(X) and g, € L(Y) such that 


YEN op (fa) = 00, > fala) = f(a) for re X—M, 
Np (Gn) < +00, Dignl(y)=g(y) foryeY—N, 


where M and WN are null sets. Then 


Np (fm X Gn) = Np (fm) Np (Gn) 5 


and so }> Nz(fm X gn) < +00. 


On the other hand, 


(10.14) Yo fm (x) gn (y) = F(x) 9(y) 


unless « € M or y € N, ie. outside (M x Y) U(X x NV). As f (resp. g) is in 
L?, there is a reasonable set A (resp. B) outside which the functions f and 
fim (esp. g and gp) are all zero (all unions of countable reasonable sets are 
reasonable), so that (14) holds outside (A x Y)U(X x B). We may suppose 
that Mc A and N c B. Hence relation (13) in fact holds outside the set 
(M x B)U(Ax N), so by property (C5) above, almost everywhere. Therefore 
the series }> fm X gn converges in mean to f x g (n° 4, theorem 6), giving 
the first result. 

To show that any h € L?(X x Y) can be approximated by functions such 
as >> f; x gi, with f; and g; in the L? spaces with respect to \ and p, h may 
be assumed to be continuous and zero outside K x H, where K C X and 
H CY are compact. The Stone-Weierstrass theorem then shows that h is 
the uniform limit on K x H of functions f; x g;, where all f; are defined and 
continuous on KC and all g; on H. 
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Extending these function by 0 outside K or H gives integrable functions 
on X or Y, and 


Ih Sof x gillp SVK x ENP Ff — DO fi x ill, 


where, the norm on the right hand side is the uniform norm on K x H, which 
may be made arbitrarily small, qed. (As in Chap. V, n° 30, theorem 30, one 
can also use partitions of unity). 

(C7) If f: X — P andg:Y — Q are measurable, then the function 


(f,g): (x,y) (f(z), 9(y)) is measurable. 
Let K C X and H CY be compact sets. Then there are decompositions 


K=Mu\|JKn, H=N | JF. 


where K,, and H,, are compact, M and N null sets, so that the restriction 
of f (resp. g) is continuous on each K,, (resp. H,,). Then, (f,g) is clearly 
continuous on the sets K, x H,, whose union, thanks to (C5), is the compact 
set K x H, up to a null set. As all compact subsets of X x Y are contained 
in such a product, the function (f,g) satisfies Lusin’s theorem, qed. 


Corollary. If f and g are complex-valued, the function f(x)g(y) is measur- 
able [and we recover (C6) by taking (2’) into account]. 

(C8) Let us suppose and yw to be reasonable and let f be a measurable 
map from X x Y to a topological space P. Then the function 


ar—> f(x,y) (resp. yr f(x,y) 


is measurable for almost all y (resp. x). 

Since \ and yp are reasonable, by statements (C3) and (C4) above, so is 
the product measure v. As was seen at the end of n° 10, then Z = NUU An, 
where JN is a null set and for each n, A, is an integrable set on which the 
restriction of f is continuous. For all « € X, let Ap, be the set of y such 
that (x,y) € Ay and let N, be the similar set relatively to N. The restriction 
of the function y + f(,y) to A,» is clearly continuous for all «. However, 
by (C1), An,z is integrable for almost all x, and w(N,) = 0 for almost all x. 
The exceptional sets occurring here depend on n, but there is a countably 
infinite number of them, so that their union M C X is a null set. As the 
relation Z = NUWAp, implies Y = N, UU An,x for all x, for all x ¢ M, 
Y is the union of a null set (namely N,,) and a sequence of integrable sets 
(namely A,,,) on which the restrictions of f, are continuous. The function 
fx is, therefore, measurable for « ¢ M, qed. 

When P is a Polish space (n° 11), for the function f, : yt f(x,y) to be 
measurable, f;'(U) needs to be measurable for all open sets U C P, which 
leads to another proof in this case. Exercise! 

Applying (C8) to the characteristic function of a measurable set A C 
X x Y shows that A, is measurable for almost all z. 
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(iv) The Fourier inversion formula. Let us consider two functions 
f,g € L*(R). Since their Fourier transforms are continuous and bounded, 
the functions f(x)g(a) and f()g(x) are integrable, and 


[feos nde = f f(a ya fof y)e(xy)dy. 


The exponential being continuous and bounded, the function f(x)g(y)e(xy) 
is integrable over R x R by (C6), and permuting the integrations gives 


(10.15) [feos ajax = f Frat 


As the Fourier transform maps the Schwartz space S to itself, g may be 
replaced with an arbitrary function h € S. By the Fourier inversion formula 
(Chap. VII, n° 30, theorem 26) in S, this substitution replaces g(y) with 


h(—y) ; and so 
/ f(a)h(a)de = i Fl-wyh(y)dy 


Let us also suppose that f is integrable. (15) can then be applied to the 
function f(—y), whose Fourier transform is 


| Fedecuay = F-2). 
/ ROMO re / enone 


for all h € S. The function f(x) —f (zx), which is the difference of an integrable 
function and a continuous one, is, therefore “orthogonal” to S, and hence 
to L(R) since every h € L(R) is the uniform limit of a sequence of functions 
hn € S vanishing outside a fixed compact. The Fourier inversion formula, 
namely 


So 


(10.16) 1) = f Fel e(xy)dy ae. if f and 7 ed: (R), 


now follows from the corollary of Lusin’s theorem [n° 8, (i)]. 
Exercise 3. Prove (16) assuming that f € L?(R) and f € L'(R). 


11 — A Topological Interlude??: Polish Spaces 


(i) Polish spaces. Metrizable and separable spaces have frequently occurred 
in previous sections. Clearly, the completion of such a space P with respect 


29 The results of this n° will not be used before n° 13. 


62 XI — Integration and Fourier Transform 


to a metric compatible with its topology, is again metrizable and separable (a 
sequence everywhere dense in P is everywhere dense in its completion) and, 
a function with values in P is measurable if and only if it is measurable as a 
function with values in the completion of P. 

From a classical viewpoint where measurability is defined using condition 
(FM) of n° 7, one may confine oneself to functions with values in spaces 
satisfying the following two conditions, involving in fact only the topology of 
P: 


(EP 1) the topology of P can be defined by a distance with respect to 
which P is complete, 


(EP 2) P isseparable. 


As was seen in n° 7, the second condition is equivalent to saying that P 
contains either an everywhere dense countable set, or else a countable family 
of open sets U,, such that any open subset of P is the union of the sets U;, 
contained in it. 

These are the Polish spaces®° of N. Bourbaki, Topologie générale, Chap. IX, 
§6. Their importance also stems from the fact that they are the only spaces 
in which non-trivial theorems about Borel sets can be proved. Apart from 
some exceptions, most results presented in this n° are not really essential for 
the rest of this chapter, but it is useful to know how far one can go without 
encountering unannounced pitfalls. 


Lemma 1. Eery metric space is Polish. 


First, by Cauchy’s criterion together with BW, it is complete with respect 
to a metric compatible with its topology. On the other hand, for all n, it can 
be covered by a finite number of open balls of radii 1/n. The centres of the 
balls thus obtained are everywhere dense in X, qed. 

Conversely, every compact space of countable type can be shown to be 
metrizable, and hence Polish. 


Lemma 2. A locally compact metric space X is separable if and only if it is 
countable at infinity. 


Indeed if X is the countable union of compact sets K,,, each K,, contains 
an everywhere dense countable subset D,,. This gives the result for X by 


3° An abbreviation that I had jokingly suggested to N. Bourbaki in 1949 after hav- 
ing learnt the subject from Casimir Kuratowski’s Topology (Warsaw Acad. of 
Sciences, 1933, in French, re-published in two volumes in the 1950s) and realized 
the contribution made by the Polish. I could have also called them “ Polono- 
Russian”. The joke, in fact not completely a joke, was taken seriously and since 
then all experts have adopted this strange terminology, generally without men- 
tioning N. Bourbaki except Kuratowski himself who, in a small book published 
in 1974 on the history of Polish mathematics, justifiably saw in it a tribute to the 
latter. Just like we make a distinction between metric and metrizable space, so 
should we make a distinction between Polish spaces (in which a complete metric 
is given) and Polishable ones (in which such a metric exists). 
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considering the union of all D,. Conversely, if (U,) is a countable basis for 
the topology on X, for all a € X, let us choose a compact neighbourhood 
V(a). Its interior being the union of sets U,, one of them contains a. Its 
closure, contained in V(a) is compact. As a is arbitrary, X is the union of 
the sets U,;, whose closure is compact. This gives countability at infinity. 

All locally compact separable and metrizable sets X will later be shown 
to be Polish. 

Many spaces encountered in classical analysis are Polish, but familiar 
spaces that are not saw can be found without venturing too far. For example, 
the space of continuous and bounded numerical functions on R equipped with 
the norm of uniform convergence, though being a complete metric space, is 
not separable. One would obtain a separable and hence a Polish space by 
confining ourselves to functions tending to finite limits as x tends to +oo or 
—oo. Indeed, these functions are just the continuous functions on the compact 
metrizable space?! [—o0, +00], the finished line of N. Bourbaki. Hence the 
result using the next exercise. 

Exercise 1. (a) Let X be a compact metric space, d a distance on X, 
(dy) a sequence everywhere dense in X and we set f(x) = d(x,an). Let A 
be the set in L(X) of polynomials in a finite number of functions f,, whose 
coefficients are complex. Show that A is everywhere dense in the Banach 
space L(X) (use the Stone-Weierstrass theorem). Replace A by the set Apo of 
polynomials in f,, whose coefficients are rational. Show that Ap is countable 
and everywhere dense in A, hence in L(X). (b) Let X be a locally compact 
metrizable space and countable at infinity (i.e. Polish, see further down). 
Show that there is a countable subset D of L(X) with the following property: 
for every compact set K C X, there is a compact neighbourhood H of K such 
that any f ¢ L(X,K) is the uniform limit of functions belonging to D and 
vanishing outside H [first show that D(X, K) is separable for all K]. Deduce 
that the L?(X; 4) spaces of measures on X are separable for p < +00. 

If a metric space X is separable, so is also every subspace Y of X since 
if (U,) is a basis for the topology on X, then the sets U, MY play the same 
role for the topology of Y. It already follows that every closed subset Y of 
a Polish space X is Polish since it is separable and complete with respect to 
every metric of X with respect to which X is Polish. As will be seen so are 
all Gs sets in X. Let us start with the simplest case: 


Lemma 3. Every open subspace P of a Polish space X is Polish. 
Let P = X — F, where F is closed, and we set 
f(x) = d(x, F) 


assuming that X is complete with respect to d. The relation f(x) 4 0 is 
equivalent to x € P. In the Cartesian product R x X, which is obviously 


31 To transform [—o0, +00] into a Polish space, choose a strictly increasing continu- 
ous function f(z) on R converging to —1 (resp. +1) as x tends to —co (resp. +00), 
write f(—oo) = —1, f(+o0) = +1, and use the metric d(x, y) = | f(x) — f(y)|. 
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Polish, the set Z of ordered pairs (u, 7) such u.f(x) = 1 is closed, and hence 
Polish. However, (u,2) ++ x is a continuous bijection from Z onto P; its 
inverse x ++ (1/f(x),x) is continuous since 1/f is continuous on P. As a 
result, P is homeomorphic to a Polish space, qed. 

The proof equally shows that every open subspace of a complete metric 
space is a complete metric space — obviously not with respect to the metric 
of X, but that of Z in R x X, namely 


(x,y) = d(x, y) + |1/d(x, F) — 1/d(y, F)| . 


If X,,..., X, are complete metric spaces, their Cartesian product, equipped 
for example with the distance 


d[(x;),(yi)] = S> di (Li, Yi) , 


is acomplete metric space. If the spaces X; are Polish, then so is their product. 

More generally, let (X,,), n € N, be a countable family of complete metric 
spaces and, for each n, choose a distance d, <1 in X,, with respect to which 
X,, is also complete.*? We denote a generic element u = u(n) of X = [] Xn 
as a function defined on N with values in the spaces X,,. The formula 


(11.1) d(u,v) = S°2-"d, [u(n), o(n)] 


defines a metric on X. As series (1) converges normally in X x X, the relation 
lim up = u with respect to the metric d is equivalent to 


(11.2) limu,(n) = u(n) for allneN 


(Chap. III, n° 13, theorem 17). Hence, if the spaces X,, are complete, then 
so is X. 

If the spaces X,, are separable, then so is X. Indeed, the definition of the 
distance shows that every ball centered at a € X contains an open set of the 
form [] En, where, for a finite set F’ of values for n, FE, is a ball centered at 
a(n) and where E,, = X,, for n ¢ F’. Choosing for all n, a basis (Un») for 
the topology of X,,, we get a basis for the topology of X by considering the 
products [] E,,, where E,, is one of the U,, for n € F and where E, = X, 
otherwise. Now, the results of Chap. I, n° 7 tell us that the set of these 
products is countable. A countable product of Polish spaces is, therefore, 
Polish. 

This is in particular the case of the set RN of functions or sequences 
u:N—>R equipped with distance (1). It is necessary to choose a distance 
< 1 on each component R, for example 


3° Tf d is a metric on a set E, the function d'(x,y) = d(z,y)/[1 + d(z,y)] is a 
metric < 1 defining the same topology and having the same Cauchy sequences 
as d. If E is complete with respect to d, it is equally so with respect to d’. Hence 
the existence of the distances dy. 
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d,,(u,v) = |u—v|/(1+ ju) . 


As in the general case, the topology of RN is just that of simple convergence: 
a sequence u,, € R% converges to some u € RN if, for all p, un(p) converges 


to u(p). 


These constructions enable us to justify what had been announced before 
lemma 3: 


Theorem 20. A subspace P of a Polish space X is Polish if and only if P 
is the intersection of a countable family of open subsets of X. 


Since a subspace of a separable space X is itself separable, we might as 
well prove a more general result: 


Theorem 20 bis. The topology of a subspace P of a complete metric space 
X is induced by a metric with respect to which P is complete if and only if 
PisaGs set in X. 


The proof is similar to that of lemma 3. Since a closed subspace of a 
metric space X is complete and is a Gs set in X, we may assume P to be 
everywhere dense in X: replace X by the closure of P. 

(a) Let us suppose that P is complete with respect to a distance d. For 
allz € X, let w(x) be the infimum of diameters*? of the (by assumption non- 
empty) sets VM P, where V is an arbitrary open neighbourhood of x in X. 
For all r > 0, V may be chosen so that the diameter of POV is < w(x) +r. 
As V is also an open neighbourhood of each of its points, w(y) < w(x) +r 
for all y € V. Therefore, the set G,, of x such that w(x) < 1/n is open in X. 
For « € P, the sets VN P are just the open neighbourhoods of « in P, which 
include all open balls. Hence w(x) = 0 and as a result, P C (]Gn. We show 
that P=()Gn. 

To do this, we consider some a € (|}G,,. There are neighbourhoods V,, of 
a in X such that the diameters of the sets V,, P are < 1/n. As any smaller 
neighbourhood is perforce suitable, X being metrizable, the sets V, can be 
assumed to be closed, decreasing and with intersection (| V;, = {a}. Choosing 
some un € V,NP for all n, for n > p,q, we see that the set V,, contains V, and 
V,, so that up, and u, are in V, 9 P. Hence d(up,u,) < 1/n for p,q > n. As 
P is complete with respect to d, (u,) converges to some u € P. But for each 
p = N, Up is in the closed set V,. So u € V,, for all n. Thus u € ()V;, = {a} 
and a € P, which proves that P = (|G, as expected. 

(b) Conversely, let us suppose that P = ()G,, where the sets G, = X —F, 
are open in X. Let d now denote a distance with respect to which X is 
complete. The relation x € P is then equivalent to 


d(a,F,) = fr(x) #0 for all n. 


33 The diameter of a non-empty set E C X is the supremum of numbers d(x, y) 
where x,y vary in E. Attention should be paid to the fact that the metric d of 
P is not that of X . 
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For all z € X, let f(x) € RN denote the sequence n+> f,(x). Like the maps 
fn, f is continuous since the relation limz, = x implies that lim f, (ap) = 
fn(x) for all n. Let Z be the subset of the Cartesian product RN x X, a 
complete metric space like R“ and_X, of ordered pairs (u, 2) such that 


(11.3) u(n)fr(#) =1 for all n. 


It is obviously closed, and hence a complete, metric space (Polish if so is X). 
The projection (u,x) > is clearly a bijection from Z onto P. Its inverse, 
which maps each « € P to the point (u,x) for which u(n) = 1/f,(x) is 
continuous for n, since if the sequence of points x, € P converges to some 
x € P, then the sequences (f,,(xp)) converge to non-zero limits f,,(z) . Finally, 
P is homeomorphic to Z, qed. 


Corollary 1. Let P a metrizable space. Suppose that P is complete with 
respect to some distance compatible with its topology. Then P is a G5 set in 
its completion with respect to any distance compatible with its topology. 


Obvious. 
The proof does not use separability. This result enables one to show a bit: 


Corollary 2. Every Polish space is homeomorphic to a Gs set in a compact 
Polish space. 


To prove this, let us consider the infinite-dimensional cube I‘, where 
I = [0,1]. It is the set of maps or sequences u : N —+ I equipped with the 
distance 


d(u,v) = pe 27" |Un — Un| 


which, as was seen above for the product RN, induces the topology of simple 
convergence. We show it is compact by using Bolzano-Weierstrass together 
with Cantor’s “diagonal argument ”:*4 if there is a sequence u(p) = (un(p)) in 
IN, the sequence u;(p) contains (BW for I) a subsequence uj (pPm,1) converging 
to a limit ui, the sequence u2(pm,i1) a subsequence u2(pm,2) converging to a 
limit ug, etc. The diagonal sequence (u(Pni.)) then converges to u = (un). 
To see this it suffices to show that 


lim Un (Pmym) = Un = lim Un (Pmn) 
m m 


for all n, which follows from the fact that, for given n, the sequence (Pm,m) 
for m > n is is a subsequence of the sequence (Pmjn)- 

This done, to construct a set Z in IN homeomorphic to a given Polish 
space P, one chooses a sequence (a,,) everywhere dense in P and a distance 


34 The following argument applies to all products of a countable family of com- 
pact metric spaces (see for example Dieudonné, (12.5.9)). A result without any 
countability or metrizability assumptions using a far more sophisticated version 
of BW (“ultrafilters”) can be found in N. Bourbaki’s Topologie générale. 
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d <1 with respect to which P est complete. Then the image Z of X under 
the map x ++ (d(an,x)) from X to IN answers the question. To see this, it 
suffices to show that in X, the relation lim x, = x is equivalent to 


limd (an, tp) =d(an,2) for all n. 


If this holds, then indeed, for all r > 0, there exists n such that d(an,x) <r. 
As d(@n, Zp) < r for large p, we deduce that d(x,, x) < 2r , whence lim z, = x. 
As P is a complete metric space, its image in JN is necessarily a G5 set, qed. 

If in particular a locally compact space P is metrizable and separable (i.e. 
countable at infinity: lemma 2), then its completion with respect to a metric 
compatible with its topology, being Polish, is homeomorphic to a subset of 
I, hence so is P itself. As P and I‘ are locally compact, P is the intersection 
of an open and of a closed subset of I’. But a closed subset in a metrizable 
space is a Gs set. Hence P is a Gs set in IX, and so: 


Corollary 3. Every locally compact metrizable space countable at infinity is 
Polish. 


(ii) Lsc functions on a locally compact Polish space. A good way to under- 
stand the quasi-necessity of introducing Polish spaces consists in analyzing 
the proof of the next result: 


Theorem 21. Let X be a locally compact Polish space. Every positive lsc 
function on X is the limit of an increasing sequence of continuous functions 
with compact support. 


It suffices to show that every positive lsc function y is the upper enve- 
lope of a countable family of functions f, € L(X) because the functions 


sup(fi1,.--, fn) converge to y. 
Let us consider the open sets 


(11.4) Un = (p(x) > p/n} 


for p > 0,n > 1. Denoting by Xpn the characteristic of the function of Upn, 
the functions Xpn, the functions 


(11.5) onl) = = > xpn() 


are Isc, and y = sup(Y»). Indeed, let x be a point of X. If p(x) < 1/n, then 
Xpn(x) = 0 for all p > 1. Hence yp (x) = 0 (if y(a) = 0) or 1 (if v(x) > 0), 
and so 


(11.6) Yn(z) < v(x) < Yn(x) + 1/n 


in this case. If g/n < p(x) < (¢+1)/n for some g > 1, the point x belongs to 
all the sets Up, such that 1 < p < q and to none of the following ones, so that 
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n(x) = g/n, leading again to (6). Finally, if g(x) = +00, then ypn(#) = 1 
for all p. Hence y,(x) = +oo = y(a). Therefore, relation (6) holds for all 
x € X and n> 1, whence y = sup(yp). 

On the other hand, if the theorem holds for a sequence of Isc functions, it 
clearly holds for their upper envelope and for their sum. Hence the proof re- 
duces to showing this for the functions yp, i.e. for the characteristic function 
of an open set UC X. 

Now, if an increasing sequence of functions f, € L1(X) converges 
to xv, clearly f,(a) > 1-—1/p for large n, for all « € U and p > 1. 
As the functions f, vanish outside U, U is the union of the compact sets 
Kom = {fn() > 1—1/p}. Hence to go on with the proof, one needs to show 
that all open subsets of X are countable at infinity.2° This is indeed the 
case since X being locally compact and Polish, so is U (lemma 1), which is, 
therefore, countable at infinity (lemma 2). 

The open set U is then the union of an increasing sequence of compact sets 
K,,. As any compact set has a compact neighbourhood in U,°° one may even 
assume that each K,,4; contains an open set U,+, containing K,,. Then, for 
each n, there exists a continuous function on U with values in [0,1], equal to 
1 on K,, and 0 on X —U,4,1 (Chap. IX, n° 17, lemma 1), hence on U — Ky,41. 
If fn(x) is replaced by 0 for all « ¢ U, the functions f, are in D(X) and their 
upper envelope is, as desired, the function xy, qed. 

Exercise 2. Let X be a metric space and U = X — F an open subset 
of X. Set fn(a) = n.inf (d(x, F),1/n]. Show that yu(x) = sup f(x) for all 
x. Deduce that every positive lsc function on a metric space is the limit of 
an increasing sequence of continuous functions on X. Does this lead to an 
alternative proof of theorem 21? 


(iii) Borel sets in a Polish space. Polish spaces have far stranger properties 
than the above ones. We are going to state some of them without proof in 
this n°, not because they will be immediately needed, but in order to show 
how quite simple theoretical constructions in set theory and general topology 
can lead to non-measurable sets, even in familiar spaces such as R. 

Exercise 3. Let B be the set in RN consisting of u = (un) for which lim up, 
exists. Show that B is a Borel set. 

Exercise 4. Let X be the set of real-valued continuous functions on J = 
(0, 1]. Equipped with the distance associated to uniform convergence, it is 
Polish (exercise 1). Let S be the unit ball of X and S% the set of sequences 
f = (fn) with f, € S, equipped with the distance 


d(f,9) =S Ifa - gall Joes 


35 X being countable at infinity is not sufficient for this. Any locally compact space, 
even uncountable at infinity, is an open subset of a compact set, for example, its 
Alexandrov compactification X = X U {oo}. 

36 Recall that a neighbourhood of a set E must contain an open set containing E. 
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Show that SN is Polish. Let B be the set of f € S% for which lim f(z) exists 
for all a € INQ. Show that B is a Borel set. Answer the same question for 
the set of f for which the sequence of functions f,, is increasing. 

The simplest example of an apparently strange Polish space is the set 


P=R-Q=()R-{6} 
EEQ 


of irrational numbers. It is a Gs set in R and even in the compact space 
[—o0, +00]. Equipped with the topology of R, P is, therefore, Polish. Here 
the sets F,, in the proof of theorem 20 bis are the sets {a}, a € Q. The space 
P is totally discontinuous: for all « € P, any of its neighbourhoods contains a 
neighbourhood which is both open and closed in P, namely the intersection 
of P and an interval centered at x and with rational endpoints. It is then 
obvious that P cannot be locally compact. 

This space has a literally miraculous “universal” property concerning 
Borel subsets: an uncountable subset B of a Polish set X is a Borel set if 
and only if it is the image of R — Q under a continuous and injective map. 
In particular, this property holds for X itself. Hence, if X and Y are Polish, 
any continous and injective map from X to Y transforms Borel subsets of X 
into Borel subsets of Y. 

The theory of Borel sets is connected to the more general theory of 
Suslin’s?” analytic sets. These are images of Polish spaces (or, more sim- 
ply of R— Q) under continuous maps that are not necessarily injective. Any 
union or intersection of countably analytic sets is analytic, but in a Polish 
space, the complement of analytic space A is analytic if and only if A is a 
Borel set. It can be shown that all analytic sets are measurable with respect 
to any Radon measure in a locally compact Polish set. 

The image of an analytic set under a continuous map is clearly also an- 
alytic, but the image of the complement of an analytic non-Borel set under 
such a map may well be neither analytic, nor the complement of an analytic 
set. Thus, the class of analytic sets leads to the definition of more and more 


37 Yakovlevich Suslin (1894-1920), son of “poor peasants” who, as stated by one 
of his biographers, brilliantly completed his secondary school studies thanks to 
the help of “rich people” from his village, was admitted to Moscow university in 
1913 where he worked, with some other future brilliant mathematicians, under 
the supervision of N. Lusin. Reading an article of 1905 where Lebesgue “proves” 
that, in R?, the projection of a Borel set is again a Borel set, Suslin noticed 
that the proof was wrong and published a counterexample in 1917 in a note to 
the Comptes-rendus of the Paris Académie des sciences (Lebesgue reacted with 
delight at his mistake: it made Science progress... ). This is practically Suslin’s 
only publication: lack of funds, health problems (seemingly tuberculosis) and 
hunger led him to return to his village, where he died of typhus. Lusin, who 
took over, remained for a long time one of the bosses of Soviet mathematics. 
In Bourbaki, any metrizable space which is the image of a Polish space under a 
surjective continuous map (resp. bijective and continuous) is called a Suslin (resp. 
Lusin) space. This terminology seems to have been adopted by all specialists. 
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complicated sets: complements of analytic sets, continuous images of these 
complements, complements of these continuous images, continuous images of 
complements of continuous images of complements of analytic sets, etc. In 
this way, we get the projective sets. In a locally compact space equipped with 
a measure they may be non-measurable contrary to analytic sets. 

These results as well as many others were obtained using classical tech- 
niques of set theory and general topology with much ingenuity and imagi- 
nation. The topic continues to be the subject of active research that often 
causes serious problems of mathematical logic.°® 


12 — Continuous Sums of Measures: Examples 


(i) Product measures. As in n° 10, let X and Y be two locally compact 
spaces, A and y positive measures on X and Y and v the product measure 
on Z= X x Y. For all x € X, the map 


(12.1) Mei f'— [fle nduty 


is a measure pt, on Z, and the function x +> j1,(f) is continuous with compact 
support, hence A-integrable, for all f € L(Z). The definition of the product 
measure can then be put in the form 


(12.2) v(f) = / po(f)dde), f €L(Z). 


If the space X is finite (resp. discrete), a sum (resp. series) of measures is 
thereby obtained. In the general case, the “discrete” sum is replaced by a 
“continuous” sum of measures. As will be seen, this situation occurs else- 
where. 


(ii) Measures induced by locally integrable densities. The most elementary 
analysis provides examples of measures induced by a “density” from the 
Lebesgue measure on R, for example the map 


fro [foster 


where f € L(R), or 


+00 
fro | f(a)a7 dx 


38 See Alexander S. Kechris, Classical Descriptive Set Theory (Springer, 1995) and 
his bibliography. For an introduction to the topic, see Chap. IX, §6, of N. Bour- 
baki, Topologie générale. The book of the Polish mathematician W. Sierpin- 
ski, Les ensembles projectifs et analytiques (Gauthier-Villars, 1952), as well as 
Topologie by Kuratowski are other references. 
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in the case of R*. In the general case of a measure A on a locally compact 
space X, a function j with values in C or in [—00, +00] will be said to [n° 5, 
(ii)] be locally integrable if fj is integrable for all f € L(X). This is for 
example the case if j € L” for an exponent p > 1 (corollary 2 of theorem 11: 
Holder). The map f +> (fj) is then a complex measure v (n° 5, lemma 5), 
the measure j(a)du(x) with density j with respect to yu, defined by 


(12.3) | teod@)= f 1@¥2).ar). 


A measure of this type is also called a measure with base pp, a N. Bourbaki 
terminology, or, in a more traditional terminology, an absolutely continu- 
ous measure with respect to w. We will see later how to characterize them 
(Lebesgue-Nikodym theorem). 

These measures can be written in similar way to (2). If dv = jdA and if 
the measure 


(12.4) Ma: f —+ §(a) f(x) 


proportional to the Dirac measure at the point x is associated to every x € X, 
then clearly 


(12.4”) pS ‘ naCantey aera Fe TONs 


In this case, functions 7 ++ u,(f) are integrable with respect to A but are 
not continuous if neither is j. 


(iii) Image of a measure under a map. Let us consider two locally compact 
spaces X and Z, a positive measure \ on X and a Lusin-measurable map 
p:X — Z. For all f € L(Z), the composite function fop is then measurable 
and bounded. Let us suppose it is integrable. This is for example the case if 
X is bounded. Then the formula 


(12.5) v(f) = / f [p(«)] a(x) 


defines a positive measure on Z, the image of A under p. Setting 


(12.6) Lo(f) = f (p(x) 


to be the Dirac measure at p(x), one recovers formula (2). 

A particularly simple case is that of a continuous and proper map 7p, i.e. 
such that p~'(K) is compact for any compact set K C Z. The function f op 
is then in L(X) for all f € L(Z). For X, one can for example take in R?, 
for Z, the space R, and for p, the projection (x,y) + x. The latter is proper 
if and only if, for every compact interval K C R, the set X MN (K x R) is 
compact. If » is the Lebesgue measure on X, the measure v is given by 
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[re )dv(a )= ff #@ )dedy . 


Hence, if m(a) denotes the (linear) Lebesgue measure of the compact set 
consisting of y € R such that (x,y) € X, then 


[ toda) = f Bom 


where integration is over R. 

Another important example: we consider a compact group G and the map 
(x,y) 4 xy from G x G to G. If A and yp are two positive measures on G, the 
image of the product measure is the convolution product  * u = v, defined 


by 
u(t) = ff fenarte)auty) 
for all f € L(G). If \ = €q and ys = €y are Dirac measures, then obviously 
Eq * Eb = Eagud- 


The generalization to the case of a locally compact group (Icg) will be given 
in n° 25. This how a group G equipped with a locally compact topology with 
respect to which the maps (a, y) + xy and 2 ++ x~! are continuous is called. 
These groups occur everywhere: discrete groups, R”, T”, closed subgroups of 
GL,,(k) where k is a locally compact field such as R, C, p-adic fields, etc. 


(iv) Quotient of an invariant measure.*° To give another example of rela- 
tion (2), we consider a locally compact space Z and a IcgG acting “on the 
left” on Z by a continuous map (g,2) +> gx satisfying obvious algebraic 
conditions. One can then define a quotient set G\Z whose elements are the 
orbits Ga of G, as well as a map p: Z+> G\Z taking each z to its class Ga. 
As in every quotient of a topological space by an equivalence relation, there 
is a natural topology on G\Z: a set U C G\Z is open if and only if p~'(U) 
is open in Z. As will be seen in n° 15, in “good cases”, the quotient space is 
locally compact. 

As G is locally compact, there is a measure on G which is invariant under 
right translations g++ ga, the (Haar measure*®), i.e. such that 


3° Do not look for proofs in this section, which will be detailed in n° 15. 

4° Such a measure always exists and is unique up to a constant factor, its image 
under g +> g~' also being an invariant measure under left translations g +> ag. A 
proof of this result can be found in Dieudonné, Eléments d’analyse, XIV.1, but 
as the proof, however ingenious, does not teach anything more than the theorem, 
it can be admitted without any inconvenience. To know the properties of Haar 
measures is far more important, especially everything that concerns convolution 
products (n° 15 and 25). 


§ 4. Lebesgue-Fubini’s Way 73 


(12.7) / v(ga)du(g) = | pla)du(g) 


for all y € L(G) and aé G. For f € L(Z), the function 


it i. f(g9z)du(g) 


on Z may then be considered. The function being integrated is in L(G) if the 
set of g such that gk M H #9 is supposed to be compact for every compact 
sets K,H C Z. Replacing z by az for some a € G replaces g by ga, hence 
does not change the integral. So function (7) is in fact a function on the space 
X = G\Z, given by 


(12.8) f(a) = i f(g2)dulg) if 2 = p(2). 


Since, for given x, the right hand side is a positive linear functional on L(Z), 
one may write 


(12.9) fo(a) = elf), 


where jz is a measure > 0 on Z, namely the image of w under g +> gz. 

As will be seen in n° 15, in “good cases”, f +> f@ is a surjective map 
from L(Z) to L(X) and even from L,(Z) onto L4(X). 

Then suppose we are given a measure v > 0 on Z, G-invariant, i.e. such 
that 


J faire = f fave) 
for all g € G. If du(g) = du(g—'), we show that 
f& =0 = v(f) =0. 


Hence the integral v(f) only depends on f@, and depends on it linearly, and 
as the map Li(Z) —> L+(X) is surjective, it follows there is a measure 
> 0o0n X such that*! 


(12.10) APS A(T?) 


for all f € L(Z). Taking account of (9), it is written 


(12.11) V(f) = pa f)aX(2) 


which again gives a continuous sum of measures. 


41 We have here a particularly clear example of the usefulness of defining a measure 
as a linear functional on functions. Adopting the traditional viewpoint of tribes 
or abstract measures would give rise to serious complications. 
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Choosing Z = R, G = Z acting on R by z+» z+ 7 and for v the Lebesgue 
measure, the quotient X = R/Z is just the group T of the theory of Fourier 
series, A is the measure dm(u) on T (Chap. VII, §1) and (11) is just the 
relation 


| f(t)dt = | dm(u) S > f(tt+n) (w= e?"") 
R T Z 
which is the cornerstone of the Poisson summation formula. 


13 — Integrable Functions with respect to a Continuous Sum 


(i) The case of lsc functions. As in the case of a product measure, the exam- 
ples in the previous n° give rise to the problem of characterizing if possible 
measurable and integrable functions with respect to the measure 


(13.1) (f) = i pin FAC). 


Hence let us consider two locally compact spaces X and Z, a measure 
’ > 0 on X anda map z+ pz from X to the set M+(Z) of positive 
measure on Z. (1) defines a measure if and only if — linearity and positivity 
— the following condition holds: 

(INT) for all f € L(Z), the function F;(v) = us(f) is integrable with 
respect to X. 

The first problem is then to compute the upper integral v*(w) of positive 
Isc function over Z. By definition, 


(132) —-v*(y) =supe(f) = sup i. ptn(f)AX(a2) = sup d (Fp) 
and 


(13.3) L(y) = sup Hr(f) = sup F(z) , 


where f varies in the set Line(y) of f € L4(Z) dominated by y. Now, the 
corresponding functions Fi, are positive and their set is an increasing philtre 
since, if h is an upper bound for f and g, F), is obviously an upper bound for 
F; and F,. By (3), the upper envelope of this philtre is the function Fy. But 
the functions fF’, have no reason to be continuous or even Isc. Hence without 
any additional assumptions, it is not possible to readily deduce that 


sup A (Fy) = A” (Fy) 
as we did when proving LF. We can only write that 
sup A (Fy) < »* (Fy) . 


Nonetheless, it is possible to obtain the equality in some simple cases. 
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(a) Suppose first that the function 7 + pz(f) is Isc for all f € L+(Z). 
This is the case in example (i) of the previous n° — they are even continuous 
—, in example (ii) if the density j(a) is lsc, and in example (iii) if the map 
p is continuous. Then, by (3), the function «+> ux(y) is lsc, and (2) shows 
(theorem 1)*? that 


(13.4) v*(y) = sup / pol f)dd(e) = / pi(y)ar*(e).. 


(8) If Z is Polish, we know (theorem 21) that y is the simple limit of an 
increasing sequence of functions f, € L,(Z). Theorem 2 of n° 2, applied first 
to v then to the measures jt, first shows that 


v*(p) = sup v (fa) = sup / pte (Fn) @X(2t) 
then that 


be (Y) = sup fe (fr) - 


Since, by condition (INT), the functions « +> j2(fn) are integrable with 
respect to A, the same theorem 2, now applied to A, enables us to pass to the 
limit under the { sign and again obtain relation (4). In practice, it is by far 
the most important case. 

(y) In the general case, let us impose an additional condition to the mea- 
SUITES [Ly 

(MVM) The family of functions 


Fry(a) =polf), fe LZ), 


is equimeasurable with respect to A. 

This means (n° 8, exercise 6) that for every compact set kK C X and for 
all r > 0, there is a compact set K’ C K such that AUK — K’) < r and on 
which all functions Fy are continuous. Under this assumption, the lemma of 
n° 8 justifies relation (4) if the set {y 4 0} is reasonable. This restriction 
cannot be removed from the general theorem (including the classical version 
of LF), but the lemma of n° 8 shows that, for want of anything better, 


(134) (x, ~) =sup / tia (Xe f) EM) = i 5 (%q¢9) €X" (2) 


always holds for every compact set K C X and even, passing to the limit, 
for every reasonable set. 

Condition (MVM) trivially holds in case (i) of the previous n°. In case 
(ii), F(a) = j(x) f(x), where j is locally integrable and hence measurable. If 


# Recall that, for us, the notation dX* (x) indicates an upper integral with respect 
to XA. 
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j is continuous on a compact set, so are the functions F’y. In case (iii) of the 
image of a measure under a map Pp, fl2(f) = f[p(x)], i.e. Fy = fop, so that 
these functions are continuous on all sets on which p is continuous. Therefore, 
if p satisfies (LUS), condition (MVM) is then satisfied. 

In what follows, we will express condition (MVM) by saying that > pi. 
is vaguely measurable. 


To justify this expression, we need to equip the space M(Z) of (complex) 
measures on Z with a topology such that condition (MVM) rephrases 
property (LUS). Vague topology is the one that is suitable: by identifying 
every measure ps on Z to the map f +> u(f) from L(Z) to C, the vague 
topology is that of simple convergence. In other words, a variable measure 
js converges vaguely to a limit jo if and only if u(f) tends to po(f) for all 
f € L(Z). There is an obvious analogy with the notion of convergence for 
Laurent Schwartz’s distributions*? (Chap. V, § 10), which incidentally is 
influenced by the vague topology used at the time by Henri Cartan in 
potential theory. Saying that x +> pz is continuous on a subset A of X 
then amounts to saying that, for all f € L(Z), the function «4 p2(f) 
is continuous on A: express that ju, converges vaguely to fa if x € A 
converges in A to a € A. Choosing a compact set for A and applying 
Lusin’s definition, one sees that condition (MVM) means precisely that 
x *+ fy is measurable with respect to the vague topology. By the way, 
note that it is not in general metrizable, even when Z is as simple a space 
as [0, 1]. 

Condition (MVM) holds if «+> pu, is vaguely continuous, i.e. if uz(f) 
is continuous for all f € L(Z). This is the case in example (i) of the 
previous n°, in example (ii) if the function j is continuous, and in example 
(iii) if p is continuous. Assumption (a) is then satisfied, and hence (4) 
holds without any restrictions on yp. 


(ii) The generalized Lebesgue-Fubini theorem. Though the results need to 
be rounded off in each particular case, the next theorem makes it possible to 
unify the situations described in the previous n°. 


Theorem 22. Let X and Z be locally compact spaces, 4, a family of positive 
measures on Z depending on a parameter x« € X, and X a positive measure 
on X. Suppose that condition (INT) holds and that this is also the case of 
one of the following conditions: 


(a) the function x +> p.(f) is lsc for all positive f € L(X) positive ; 
(8) Z is Polish; 


43 As in this case, the open subsets of M(Z) can be easily defined. For given 
boo € M(Z) and fi,..., fn € L(Z), let W (10; fi,.--, fn) be the set of p € M(Z) 
such that |u(fi) — uo(fi)| < 1 for 1 <i<n.Aset U C M(Z) is then open if and 
only if it is the union of sets of this type. See some additional fact in Dieudonné, 
XIIL4. 
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(y) the map «++ [ly is vaguely measurable with respect to X. 


The following results then hold. 
(i) The function 


(13.5) U(f) = / po(f)dde), fe L(Z), 


is a positive measure on Z. 
(ii) If f € L1(Z;v), then f is u,-integrable for almost all x € X, the 
function «> ps(f), defined for almost all x, is -integrable and 


(13.5" [towe =f ae f Ferduel). 


(iii) If f is measurable with respect to v and vanishes outside a reasonable 
set, then 


(13.5”) v* (\fl) = / oe ([f[) an*(a). 


The following arguments bear a strong resemblance, and with good reason, 
to those used to prove the Lebesgue-Fubini theorem. 

Proposition (i) in the statement is obvious.*+ 

To prove (ii), we start with the equality 


v*(y) = / pk (gd (2) , 


proved above for all positive Isc functions on Z in cases (a), (3) and, if 
v*(p) < +00, in case (y); we then imitate the proof of the classical LF 
theorem. Its part (b) and relation (4) lead to the inequality 


(13.6) vi(f)> / ye (f)d\* (x) 


which holds for every function f > 0 on Z, including in case (7¥): there is 
nothing to prove if v*(f) = +00, and otherwise it suffices to calculate v*(f) 
using lsc functions y dominating f and for which v*(py) < +00; in that case, 
(4) can be applied to them. For a characteristic function, (6) shows that 


(13.7) {v*(N) = 0} => {u3(N) =0 for almost all x} . 


Now, let f be a function on Z with values in C or [—o0, +00], and suppose 
it is integrable with respect to v. Denoting by N; the L' norm with respect 


44 Because measures have been defined as positive linear functionals on L(X). If 
the other possible definition had to be used (countably additive functions on 
Borel sets, for example), we would come across additional difficulties. . . 


78 XI — Integration and Fourier Transform 


to the measure v, there is (n° 4, corollary 1 of theorem 7) then a null set 
N Cc Z with respect to v and a series of functions f, € L(Z) such that 


So Ni(fn)<+0o0 & f(z)= 50 fn(z) for all ze Z—N. 


The series )> f,, converges in mean to f in L'(Z,v) and 


(13.8) Wf) =v =o / pto (Fn) @X(2) 


However, 
(13.9) > fu (\fal) d(x) = Sv (fal) = SD Nin) < +00. 


By theorem 6, S> fz (|fn|) < -++oo outside a A-null set M’ C X, the series fy, 
moreover, converging in L1(Z, jz) for all a € X — M’. 

But > fn(z) = f(z) for all z € Z—N and (7) tells us that ux(N) = 0 for 
all 2 ¢ M”, where M” is a \-null set. Hence, if x ¢ M’ UM” = M, a d-null 
set, then the series f,,(z) converges to f(z) both .-almost everywhere and 
in L'(Z, wu). Thus (theorem 7) f € L'(Z, ux) and 


(13.10) Half) = Sea) for c¢ M. 


As SO |Me(fn)| < oa (fal), relation (9) shows that the function 1.(fp) is 
A-integrable and that 


Sf tel fa)drle) = f ada). Hala) = 20) 


But Yo fal(fn) = He(f) for « € M by (10). The function « + pz(f), de 
fined almost everywhere with respect to X, is, therefore, A-integrable with, 
furthermore, 


(13.11) (f) = / pin FAC) , 


which proves point (ii) of the statement (including for functions with values 
in a Banach space). 

To prove (iii), let f be a reasonable and measurable function on Z with 
respect to v, with values in [0, +00]. Then f is the simple limit of an increas- 
ing sequence of v-integrable functions f, > 0 converging everywhere (n° 9, 
corollary 3 of theorem 18). Theorem 2 shows that 


v*(f) =limv* (fr) = lim (fn) 


and hence, by proposition (ii) proved above, that 
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v*(f) = tim fi Fu)AX* (2). 
The same theorem 2 applied to a fz, shows that 


lim 5 (fr) = ui(f) for alae xX. 


The sequence of functions x +> yw*(f,) being increasing, applying theorem 2 
once again this time to A, shows that 


tim f ustayan(a) = f us )a(e), 


whence (5”), which completes the proof. 
The necessary and sufficient condition for integrability should also be 
added, namely 


/ uo (fl) aX*(a) < +00 


if f is measurable and reasonable with respect to v. 

Finally, a p'” power v-integrable function also has this property with 
respect to fz for almost all x. Denoting by f and f, its classes in L?(Z;v) 
and L?(Z; Ux), 


(13.12) lll? = / I fell? aX(2). 


For p = 2, we compute the inner product in L?(Z;v) by 


(13.13) (flg) = / (felge) aX(2) 


a formula analogous to the one defining inner products on direct products of 
Hilbert spaces. But the analogy is misleading; it is already so in the case of a 
space X with two elements: the L? space of a sum ju; + /42 is not in general the 
direct sum of the L? spaces of the given measures (trivial counterexample: 
take 41 = 2). We will come back to this point in the context of the Lebesgue- 
Nikodym theorem. 


14 — Integrable Functions with respect to the Image of a Measure 


Theorem 19, i.e. the classical formulation of LF, is not exactly a simple re- 
wording of theorem 22. The latter asserts that, for almost all « € X, the 
function f is ,-integrable, whereas theorem 19 asserts that the function 
fely) = f(a, y) is u-integrable. To obtain proposition (i) — and hence (ii) and 
(iii) — of theorem 19, it is thus necessary to show that 


(14.1) FEL (We,Z) > fe € L(Y p) 
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and that, in this case, jiz(f) = u(fe). Similarly, (10.5’) reduces to formula 
(13.2’) of theorem 22 if 


(14.2) a (f) = WH (fe) 


for all reasonable functions f. All this may seem obvious since jz is obtained 
by transposing to the vertical 


Y(x) ={a}xY 


of x the measure yz given on Y or, equivalently, since pz is the image of 
under the map y+> (x,y) from Y to Z. It still needs to be proved. 

As an exercise, the reader will easily find a direct ad hoc proof. But a 
general result holding in the framework of example (iii) of n° 12 can also 
be proved.*° It will come in useful later in the very particular case where 
p: X + Z is continuous and proper. While somewhat simplifying the proof, 
this assumption does get rid of all difficulties. 


Theorem 23. Let X and Z be two locally compact spaces, X a positive 
measure on X and p a measurable map from X to Z. Suppose that, for all 
f € L(Z), the function f op is \-integrable. Let 


Uf)=Afop), feElL(Z), 


be the image measure of X under p. 
(i) A reasonable function f on Z is v-integrable if and only if f op is 
A-integrable ; then 


(14.3) U(f) =A(fop). 
(ii) A function f on Z is v-measurable if and only if fop is \-measurable. 
(iii) the equality 

(14.3’) u"(f) =A*(f op) 


holds for any reasonable (but not necessarily measurable) function f > 0, 
and for all f > 0 if the map p is continuous and proper. 


Let us first state two results which we will will need: 


Lemma 1. Let X and Z be two locally compact spaces and p a continuous 
proper map from X to Z. The image p(F’) of every closed set F C X is 
closed in Z. 


The restriction of p to F being proper, we may assume that X = F. 
Let z € Z be a closure point of p(X). Since every compact neighbourhood 
W of z has non-trivial intersection with p(X), the compact sets p~'(W) are 


45 See N. Bourbaki, Intégration, Chap. IV, §4, from which I got rid of the “essential 
upper integrals” (limits of integrals over compact sets). 
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non-empty. The intersection of two sets of this type being again of the same 
time, they have a common point x (Chap. V, §2, n° 6, corollary 1 of the 
Borel-Lebesgue theorem, which holds in all generality). As p(a) belongs to 
all compact neighbourhoods W of z, z = p(x) € p(X), qed. 


Lemma 2. Let X and Z be two locally compact spaces, p a continuous proper 
surjective map from X to Z and f a map from Z to a topological space. f is 
continuous if (and only if) f op is continuous. 


By lemma 1, p maps every closed subset of X onto a closed subset of Z. 
Hence for a set A C Z to be closed it is sufficient (and necessary since p 
is continuous) for p~'(A) to be closed. Hence V C Z is open if and only if 
p+(V) is open in X. By the way, this shows that Z is the quotient space of X 
by the equivalence relation p(x) = p(y). If f is a map from Z to a topological 
space FE and if U Cc X and V C Z are the inverse images of an open set W 
of E under f op and f, then U = p-1(V) and V = p(U). As a result, U is 
open if and only if so is V, qed. 


Let us come back to the family of measures 


As was seen before the statement of theorem 22, it satisfies condition (7) 
of theorem 22, and even condition (qa) if p is continuous. In all cases, v = 
J Had A(z). 

Proof of (i): necessity of the condition. If f is v-integrable, then theo- 
rem 22 shows that f is j1,-integrable for almost all x — not very surprising*® -, 
that the function u.(f) = f[p(x)], defined for almost all x, is A-integrable, 
and finally that v(f) = [ f[p(x)]d\(x), whence (3). 

If in particular f is the characteristic function of a v-integrable set EC Z, 
then f op is that of p-!(E£). If E C Z is v-integrable, then one deduces that 
p ‘(E) is A-integrable, and one gets 


(14.4) v(E) = X[p7*(B)] . 


If E is a null set, then so is p-+(£). 

Proof of (ii). Suppose that f op is \-measurable and let us show that so 
is f with respect to v. Let H C Z be compact. As stated above, A = p~'(H) 
is integrable. Since p and f op are measurable, there is a sequence of compact 
sets kK, C Aon which p and f op are continuous and such that M = A-U Ky, 
is a null set. We consider the compact sets H,, = p(K,,). As p is a continuous 
and surjective map from K,, to H,, and as f op is continuous on K,,, lemma 2 
shows that f is continuous on H,,. It remains to show that N = H —U Ay 
is a null set with respect to v. Now, 


p'(N)=A-|Jp\(Hn) CA-LJ Kn =M, 


46 but not totally trivial: it means that |f(«)| < oo for almost all «. 
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which is a null set. As N is a Borel set contained in H, it is v-integrable, and 
so by (4), v(N) = A[p~!(N)] < ACM) = 0. Thus f is measurable. 

Conversely let us suppose that f is v-measurable and that K C X is 
compact. K is the union of compact sets K,, on which p is continuous, up to 
a null set. To show that fop satisfies condition (LUS) on K, it suffices to show 
it for each K,,. In other words, p may be assumed to be continuous on kK. But 
then p(K) = H is compact, and so H = NU\) Hp, where N = H —\J A, is 
a null set and the sets H,, are compact ones on which f is continuous. The 
function f o p is continuous on K, = KM p~'(H,,). These are compact sets 
since they are closed in K. On the other hand, (4) applied to N shows that 
Kp (N) = M is a null set. As 


Kn p\(H) =p \(N)N{ |p! (An); 


K = MUUKz. Hence f op is measurable even if f is a function with values 
in a topological space. 

Proof of (i): sufficiency of the condition. We suppose that f o p is inte- 
grable, and hence measurable. By (ii), f is measurable. If f is reasonable, 
proposition (iii) of theorem 22 enables us to apply (3’). Hence, if f o p is 
integrable, then v*(f) < +00 and f is integrable. 

In particular, this shows that a reasonable set EC Z is integrable (resp. 
null) if and only if so is p~'(E), in which case (4) applies. 

Proof of (iii): case where p is continuous and proper. Let y be a Isc 
function on X and ~w the function on Z given by 


(14.5) v(2) = inf y(e). 


We show that w is Isc. Since p~'({z}) is compact for all z, there is a point 
in p-+({z}) where ¢ reaches its minimum over this compact set [Chap. V, 
n° 10, (vi)]. Hence for given m € R, ~(z) < m if and only if there exists x 
such that p(x) = z and v(x) < m. So putting 


F={y(t)<m}, G={¥(z)<m}, 


we get G = p(F). Since p is continuous and proper, p(F’) is closed (lemma 1), 
whence the result. 

This being so, let f be a positive function on Z. By (13.6), v*(f) > 
\*(f op), so that it suffices to prove that v*(f) < A*(y) for any Isc function 
yp > f op. But for the corresponding function (5), obviously wo p < wy. As 
fop<vwand sof(z) < v(x) if p(x) = z, we also get f < w. Hence both 


u(f) <u") and A*(pop) < A*(y) 


hold. 
The map p being continuous, assumption (a) of theorem 22 holds. As 
is Isc, by (13.4), v* (a) = A*(Wop), which gives (iii) in this particular case. It 
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will come in useful for the completion of the proof in the general case. The 
idea is the same as in the previous proof, but contains additional technical 
details. 
Proof of (iii): general case. We first note that (3’) is proposition (iii) of 
theorem 22 if f is measurable. But (3’) only assumes f to be reasonable. 
By (13.6), v*(f) > A*(f op), which, as in the preceding case, reduces the 
proof to showing that 


(14.6) v*(f) <A*(y~) for any Isc function y > fop. 


As f is reasonable, it suffices (theorem 2) to do this when f vanishes outside 
a compact set H C Z; f op is then zero outside the set A = p-!(H), which 
is integrable by the already proved proposition (ii). Since p~![p(A)] = A is 
measurable, (ii) proves that p(A) = B is measurable, and hence integrable 
since it is contained in H. 

Let us choose an increasing sequence of compact sets K,, C A on which 
p is continuous and such that M = A—\)K,, are a null sets, and consider 
the compact sets H,, = p(K,,) and the measurable set N = B —|J Hy. Since 
p \(N) CM, N is a null set by (4). Then B= NUU An. 

Let vy be a positive lsc function on X dominating fo p. Setting 


(14.7) Un(z)= inf y(x) for z€ A,, 
(14.7’) Yn(z) =+00 for ze B-H,, 
(14.7”) Yr(z) =0 for zEZ-B, 


let us show that 
(14.8) Un(z) => f(z) forall ze Z. 


If z € Hy, then v(x) > f[p(x)] = f(z) for all x € K, such that p(x) = z, 
whence ¢),(z) > f(z) by (7). If z € B— Hy, the same holds by (7’). Lastly, 
if z € Z — B, then both sides are zero by (7”). 

On the other hand, clearly, 


(14.9) p(t) >Ynop(x) on Kp. 

Finally, the sequence w,, is decreasing since K, C Ky4+1 for all n. 
Setting 7 = inf(¢%,), w(z) = f(z) by (8). Thus 

(14.10) y*(p) > v*(f). 


Let us show that v(x) > ¢op(a) ae. For x € K, = A-— M, this follows from 
(9). If a € M, it does not matter if it is false since M is a null set. Finally, if 
x € X —A, then p(x) € Z — B, whence wo p(x) = 0 by (7”). 
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This implies that 
(14.11) A*(y) = A“ (op). 


By (10) and (11), the proof of relation (6) and hence of proposition (iii) will 
follow if we show that 


(14.12) u*() = A*(wop). 


As w is reasonable by (7”), for this purpose, it suffices to check that ~ is 
measurable [theorem 22, (13.5”)] or, since = = inf(¢,,), that so are Wn. 

The function wv, being constant on the measurable sets B—H,, and Z—B, 
the proof reduces to verifying it on the compact set H, = p(K,,). Now, p is 
continuous (and, trivially, proper) on K,,, and the restriction of y to Ky, is 
Isc. Replacing X and Z by K,, and H,,, we are back in the particular case 
proved above: the restriction of the function w, to H,, is lsc, qed. 


15 — Invariant Measures under Group Actions 


We will now consider case (iv) of n° 12, but the topics covered in this n° 
go far beyond the framework of a simple application of the generalized LF 
theorem. It is rather a matter of presenting the abc of what André Weil called 
Integration in topological groups and its applications in a famous book which 
was like the Bible to me in the 1940s (Paris, Hermann, 1941). 


(i) Invariant measures on a group. On every lcg G, there is measure d,(g) 
invariant under right translations g +> ga, as well as a measure dju;(g) inva- 
riant under left translations g +> ag. Though the existence theorem of these 
Haar measures is essential for the general case, as mentioned above, it does 
not teach us anything beyond its simple statement. In practice, the prob- 
lem is the explicit calculation of these measures; this requires quite different 
methods, in particular for Lie groups often needing the Chap. IX results on 
differential forms.*” 

Exercise 1. Let G = GL,,(R) be the group of n x n invertible real matri- 
ces, open in M,,(R). Show that the measure det(g)~"dm(g), where m is the 
Lebesgue measure on M,,(R), is right and left invariant on G. Is it well-defined 
as a measure on M/,,(R)? 

In fact, the most useful result is that these invariant measures are unique 
up to constant factors. The method presented in Chap. IX, n° 10, lemma a 


47 A Lie group is equipped with the structure of a C®@ (and even analytic) man- 
ifold such that the map (x,y) 1 xzy~! is C™. For all g, the map x > gz is 
a diffeomorphism whose tangent map at e is an isomorphism from G’(e) onto 
G'(g). Choosing an alternating multilinear form of maximum degree w # 0 on 
G'(e) and denoting by w(g) the form on G’(g) it induces by x > gz, we get a 
differential form (also denoted w) of maximum degree and everywhere 4 0 on G. 
This shows that the manifold G is orientable and enables us to define a positive 
measure pt on G by p(f) = f fw. It is the left invariant measure. 
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shows this as easily as for R”. On the other hand, the map g ++ g~! obviously 
transforms every left invariant measure into a right invariant one. 

It is often troublesome to use the same letter ys to denote the Haar mea- 
sures of several different groups, and hardly more convenient to denote them 
by A, uw, v, etc. Just like dz denotes the Haar (i.e. Lebesgue) measure on 
R”, in his book, André Weil only used dg (or dh, dk, etc. if the group is 
written H, K, etc.) for the left invariant measure of G. He never used the 
right invariant measure since, to integrate a function f with respect to it, one 
computes [ f(g~')dg. This convention simplifies notations when only general 
groups are considered. Other authors prefer to distinguish between djg and 
d,g by setting d,g = dj(g~'). In his Chap. VII et VHI on the Haar measure 
and the convolution product, N. Bourbaki denotes by 6 — why not? — the 
left invariant measure. Hereinafter, I will follow Weil’s convention, even if it 
means coming back to more precise notations where required. The notation 
L(G) will refer to the measure dg, and so will any mention of integrable and 
measurable functions unless otherwise stated. 

The uniqueness of the measure dg up to a factor has consequences. If, for 
a given a € G, we consider the image of the left invariant measure dx under 
the right translation x > xa, we again get a left invariant measure because 
b(xa) = (bx)a. Hence there is a number Ag(a) = A(a) > 0 such that*® 


(15.1) A Aca )ae = ACa) / fade 


for all f € L(G) or more generally integrable. Then A(xy) = A(x) A(y) and 
the function A is clearly continuous. In many cases, A(x) = 1, especially if 
G is compact (Chap. IX, n° 10, lemma e). G is then said to be unimodular. 

The measure A(x)~‘dz is right invariant since, in short, A(ra)~'d(xa) = 
A(xa)~tA(a)dx = A(x)~tdx. The measure d(x~') is also right invariant 
since (va)~' = a~'a~1. Hence d(x~!) = cA(x)~'dx with a constant c > 0, 
and so, r++ x~! gives dr = cA(x).d(x~') and thus c= 1. As a result, 


(15.2) d(xa) = A(a)dz , da~' = A(a)~‘dzx = dpx. 


Since the second relation can be set out more explicitly as 


[ie ryae= f p@a@rae, 


it, in particular, shows that if G is unimodular, then the Haar measure is 

invariant under x a7}. 

48 Applying (1) to the characteristic function of a measure set X C G gives the 
m(Xa) = A(a)m(X), where dm(x) = dz. If, like Leibniz, we consider that the 
symbol dz denotes an “infinitesimal volume element” at point a, then dr.a = 
A(a).dx, which is more conveniently written as d(xa) = A(a)dx. Then Formula 
(1) is obtained by the variable change x +> xa in the first integral since it 
transforms f(ra~') into f(x) and dx into d(xa). The notation d(x~') used in 
(2) can be interpreted in a similar way. Compare with the change of variable 
formulas for multiple integrals (Chap. IX, n° 10). 
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Exercise 2. Let G be the group of matrices*® (x y|0 1) witha € R*,y ER. 
It acts on R by t') at+y. Show that its invariant measures are of the form 
j(x, y)d*xdy and calculate the A function of the group. 


(ii) Continuous linear representations. In all cases, the group G defines 
continuous linear operators on L?(G) spaces, whose study, for p = 1 and 
mostly for p = 2, is the foundation of harmonic analysis on G. For example, 
for all a € G, denoting by L(a) the operator transforming every f € L?(G) 
into the function 


(15.3) L(a)f:a-+ f (a-*z) : 


which is also in L?(G), defines an isometric operator (i.e. preserves the norm) 
on this Banach space, and for p = 2, even a unitary one,°? and L(ry) = 
L(a)L(y) for all x,y. An operator R(a) in L? defined by 


(15.3’) R(a)f :a-+ f(xa) 


can be associated to all a € G. Then R(ay) = R(x) R(y) and 


but the operators R(y) are isometric only if G is unimodular. The maps 
xr L(x) and z+ R(x), called left and right regular representations from G 
to the L? spaces, are not continuous as maps on the Banach space of bounded 
operators on L?(G), but the maps (2, f)  L(x)f and (a, f) 4 R(«)f from 
G x L? to L” are. To see this, first notice that 


|L(x)f — £ (20) foll, $ IE@)I-If — foll, + 2(@) fo — £ (@o) foll, - 


As (a, f) converges to (20, fo), the first term on the left hand side tends 
to 0 since ||Z(x)|| = 1. It, therefore, suffices, to analyze the second term, 
namely to check that the map 7 > L(x)f from G to L? is continuous for 
all f € L?. Checking it for everywhere dense functions f is in fact sufficient 
because, if f, converges to f, L(x) f, converges uniformly to L(x)f on G. 
However, if f € L(G), then this is immediate since if x tends to zo, thus stays 
in a compact neighbourhood V of zo, the function L(x) f vanishes outside 
the compact set VK, where K is the support of f and, f being continuous, 
converges uniformly to L(#o) f while remaining zero outside V K. This implies 
convergence in L?. Same arguments apply to R(x). 


4° To simplify typing, it is convenient to set 


(a bled) = é A 


5 . ae . . . 
5° i.e. bijective and isometric. See n° 19. 
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This situation can be generalized using the notion of a linear representa- 
tion (implied to be continuous) of G: It consists of an ordered pair?! (H, U) 
where H is a Banach space and U a map x +> U(x) to the group of continuous 
invertible operators of H, satisfying the following two conditions: 


(RL 1) U(ay) = U(x)U(y), where U(e) = 1 and U(a~!) = U(a)“1, 
(RL 2) the map «+ U(«)a from G to H is continuous for all a € H. 


Condition (RL 2) actually implies that (z,a) 1 U(x)a is continuous. This 
can be shown by writing as above that 


||U(x)a — U (20) aol] < ||U(@)|| - lla — aol] + || (x)ao — U (a0) all 


and by taking into account the fact that °? 


(15.4) sup ||U(«)|| < +00 
cok 


for any compact set kK C G. This enables one to show that the first term of 
the right hand side tends to 0 (for K take a neighbourhood of xo), the same 
holding for the second term by (RU 2). It is actually enough to check (RU 2) 
for vectors everywhere dense in H. 

If H is a Hilbert space®® and if all U(x) are unitary, (H,U) is said to be 
a unitary representation of G. In dimension 1, such a representation is just 
a continuous solution with absolute value 1 of the functional equation 


5! What I write as U(g) is usually written 1(g), following Harish-Chandra. In alge- 
bra, which (theoretically) does not involve topology, a linear representation of a 
group G is a homomorphism g +> U(g) from G to the group GL(£) of in general 
a finite-dimensional vector space F over a commutative field. 

Since continuous functions x ++ ||U(a)al| are bounded on every compact set for 
all a € H, it suffices to use a Banach theorem, a quick proof for which follows. 
We start by proving the famous Baire’s theorem: let H be a complete metric 
space and (Fp) a sequence of closed sets such that H = UJ F,. Then at least 
one of the sets F, contains an open subset of H. Otherwise, if B is a closed 
ball, the interior of B is not contained in any F,, and so for each p, contains a 
closed ball which does not intersect F;,. Hence, starting from an arbitrarily chosen 
closed ball Bo of radius 1, we can construct a decreasing sequence of closed balls 
By, of radius < 1/p, such that B,M Fp = @ for all p. As H is complete, there 
exists x € () Bn, a contradiction since x € F, for at least some p. (Exercise: 
the theorem continues to hold if H is replaced by some G5 set in H and the 
sets F, by their intersections with it). Banach-Steinhaus theorem: Let E be a 
set of continuous linear maps from a Banach space H to another. Suppose that 
sup ||Ta|| = p(a) < +00 for alla € H. Then sup||T'| < +00. Otherwise, there 
exist T, € E such that sup ||T;|| = oo. For all p, the set F, of x € H satisfying 
||Tnx|| < pllz|| for all n is closed, and as sup ||Tn2|| < +00 for all z,H = UF). 
Hence, one of the sets Fp contains a ball B(a,r). Then, for ||xz — al] <r, 


\|Tn (x — a] < [Tra] + ||Trall < pllal]| + pla) =e, 


a constant independent of n; whence sup ||T;|| < c/r, a contradiction. We also 
state the closed graph theorem: a linear map T :H1 —> Hz is continuous if and 
only if its graph G C Hi Xx He is closed. 

53 For elementary properties of Hilbert spaces see n° 19. 
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x(xy) = x(2)x(y) - 


This is called a character of the group G. If G is commutative, there are 
enough of them so that, like on T or R, one can construct a theory of Fourier 
transforms on G without any explicit calculations (§ 7). But if G is not com- 
mutative, the function 1 may well be the only solution, for example as in the 
case of SZ2(R) which does admit any non-trivial finite-dimensional unitary 
representations. Characters are then replaced by irreducible unitary represen- 
tations, i.e for which the only closed subsets of H stable under the operators 
U(«) are {0} and H. However, since 


(U(x)a|U(x)b) = (alb), 


if a closed vector subspace of H is invariant under operators U(x), then so 
its orthogonal subspace and conversely. Among other things, this property 
explain the importance of unitary representations. If H is finite-dimensional, 
in which case it contains a non-zero invariant subspace of minimal dimen- 
sion (possibly H itself), it then follows that H is a direct sum of minimal and 
pairwise orthogonal invariant subspaces, i.e. on which the representation is ir- 
reducible. As it is based on a simple dimension argument, the latter no longer 
holds in infinite dimension, though there are cases where it does (compact 
groups), but one can prove a similar but imperfect result by broadening the 
notion of a direct sum. In the case of a general lcg, one first shows (Gelfand 
and Raikov, 1943), using quite ingenious arguments from functional analysis 
(see end of n° 30), that there are many irreducible unitary representations,°4 
then that any unitary representation is a “continuous sum” or a “ direct inte- 
gral” of irreducible representations in the sense of n° 24. Chap. XII will give 
concrete non-trivial examples of infinite-dimensional irreducible representa- 
tions. If G is compact, the irreducible representations are finite-dimensional 
and for any unitary representation (H,U) of G, there is a decomposition of 
H into a Hilbert (“discrete”) direct sum of minimal invariant subspaces, in 
which the U(a) define irreducible representations (end of n° 29, (iii). This is 
not hard to prove for the regular representation of a linear group since in this 
case the existence of several minimal finite-dimensional invariant subspaces 
is obvious: 

Exercise 3. Let G be a compact subgroup of GL,,(R). (a) Show that 
polynomial functions in coefficients of g are everywhere dense®’ in L?(G) 
(use Stone-Weierstrass). (b) Let Hg be the subspace of H = L?(G) con- 
sisting of polynomials of total degree < d in coefficients of G. Show that it 


54 This means that for all « 4 e, there is an irreducible representation such that 
U(x) # 1. Gelfand and Raikov’s article being in Russian, it is probably better 
to look up Henri Cartan and Roger Godement, Analyse harmonique et théorie 
de la dualité dans les groupes abéliens localement compacts (Ann. Sc. de l’Ecole 
normale supérieure, 1947). 

°° Polynomial functions on a non-compact closed subgroup G of GIn(R) can never 
be in L?(G). In this case, the exercise shows that there are finite-dimensional 
linear representations of G, but nothing more. 
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is finite-dimensional and invariant under operators L(x) and R(a) defined 
in (3) and (3’) or as the saying goes, bi-invariant. (c) Show that every left 
invariant (resp. bi-invariant) finite-dimensional subspace M 4 {0} of L?(G) 
contains a minimal subspace of the same type and that M is the direct sum 
of pairwise orthogonal minimal left invariant (resp. bi-invariant) subspaces. 
(d) Show that L?(G) is a Hilbert direct sum of minimal left invariant (resp. 
bi-invariant) subspaces. (e) For G = T, rewrite the result in the language of 
the theory of Fourier series. 

Unlike the above, the next exercise shows an intractable situation. 

Exercise 4 (von Neumann, 1943). Let G be a discrete group, so that the 
measures or Dirac functions 


éx(y)=1 ify=a, =0 ifyFu 


form an orthonormal basis for L?(G). Show that L(a)ey = €ry. Let A be a 
continuous operator in L?(G) such that AL(x) = L(«)A for all x. Show that 


Af@)= >< aay") f@) 


for all f € L?(G) where a = Ae, € L?(G). Show that AL(x) = L(x)A for all x 
if and only if the function a satisfies a(zyx—!) = a(y) for all x and y. Suppose 
that, for all y 4 e, the set of conjugates zyx! of y is infinite. Show that A is 
a scalar and deduce that in this case, L?(G) does not contain any non-trivial 
closed bi-invariant subspace (observe that the orthogonal projection operator 
onto such a subspace commutes with all LZ and R. This also mean that the 
unitary representation (x,y) + L(x)R(y) from Gx G to L?(G) is irreducible. 
[in this case, the quotient of the group SL2(Z) by its centre {1,—1} is, like 
all “arithmetically defined” groups such as SL,,(Z), the group of matrices 
with integer entries preserving an (indefinite) quadratic form with integer 
coefficients, etc.] 


(iii) Quotient of a space by a group. Let Z be a locally compact space and 
G a lcg acting on the left (or the right) on Z by a continuous map (g, 2) +> gx 
(or ag) satisfying obvious algebraic conditions. For A C G and B C Z, let 
AB be the set of products gz, where g € A and z € B. For B = {z}, the 
set GB, written simply Gz, is the orbit of z. gB is defined likewise by taking 
A= {g}-. 

One can then define a quotient set X = G\Z (or Z/G if G acts on the 
right), amap p: Z +4 G\Z taking each z to its orbit Gz, and a topology on 
X with respect to which the set U C G\Z is open if and only if p~'(U) is 
open in Z. For simplicity’s sake, it is often convenient to use the notation 


p(z)=2. 


The continuous map p is open, i.e. transforms every open subset W of Z 
into an open subset U of X since p~'[p(W)] =U gW is a union of open sets. 
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Hence, if X is separated, p maps every compact neighbourhood of z € Z onto 
a compact neighbourhood of p(a) in X. As a result, X is locally compact if 
it is separated. 


Lemma 1. Let Z be a separated topological space and R C Z x Z the graph*® 
of an equivalence relation on Z such that the map p: Z++ Z/R is open. Z/R 
is separated if and only if R is closed in Z x Z. 


Let us set Z’ = Z/R. The map (2, y) + (p(x), p(y)) from Z x Z to Z' x Z' 
is continuous. However, the set R of ordered pairs such that p(2) = p(y) is 
the inverse image of the diagonal of Z’ x Z’, which is closed if and only if Z 
is separated (exercise!) . Hence the necessity of the condition follows without 
any assumption on p. Conversely, if R is closed and if p(x) 4 p(y), the ordered 
pair (x,y) is exterior to R, hence there are neighbourhoods U and V of « 
and y in Z such that U x VM R= @. Then the images p(U) and p(V) are 
disjoint open neighbourhoods of p(x) and p(y) in Z’, qed. 


Lemma 2. Let G be a group acting on a separated topological space Z. The 
quotient space G\Z is separated if and only if the following condition holds: 
for every pair of points c,c! € Z such that Gc 4 Gc’, there are neighbourhoods 
U and U' ofc andc! in Z such that 


(15.5) gUNU'=@ forall geG. 


For any compact set K C Z, the set GK is then closed. If Z is Polish, then 
GB is a Borel set for any open or closed set BC Z. 


We denote by R the equivalence relation defined by G. As was seen above, 
the map p is open. Let (c,c’) be a closure point of R. Any neighbourhood of 
(c,c’) contains a set U x U’, where U and U’ are neighbourhoods of c and c’ 
in Z. Saying that U x U’ intersects R non-trivially means that there is some 
z€U, 2’ €U' and g €G such that z’ = gz, i.e. some g such that?’ gU#U’. 
The statement R is closed is expressed by saying that if this condition holds 
for all U and U’, then (c,c’) € R, i.e. Gc = Gc’. This gives condition (10). 

To prove the second proposition of the lemma, suppose first that B is 
compact. Then GB = p~'[p(B)]. As p(B) is compact and hence closed G\Z 
being separated, GB is closed, whether or not Z is Polish. If it is, then it 
is metrizable and countable at infinity, and as was seen in n° 10, so are all 
its locally compact subspaces. Hence, if B is the countable union of compact 
sets, for example if B is closed or open, then GB is the countable union of 
closed sets, and so is a Borel set,°® qed. 


56 Tt is the set of ordered pairs (x, y) satisfying the given relation. 

°7 Generally speaking, one writes E#F instead of ENF # 2. 

°8 One may wonder whether the result continues to hold for all Borel sets B. In 
fact, it is only possible to sat that p(B) is analytic (n° 10), and hence that so is 
p'[p(B)| = GB, so that p(B) is measurable with respect to any measure on Z. 
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Lemma 2 does not assume that G is equipped with a topology. Suppose 
that that G is locally compact and that the map (g, z) > gz is continuous. 
R is the image of X = Z x G under the continuous map 


mw: (2,9) +--+ (2,92) 


to Y = Zx Z. If G is locally compact, so are X and Y. On the other hand, the 
graph of the equivalence relation defined by G being 1(Z x G), it is closed 
if 7 is proper (n° 14, lemma 1), i.e. if the inverse image of any compact 
set kK C Z x Z is compact. As it is closed, it suffices to express this for a 
product B x A, where A and B are compact. Then 7~!(K) = B x C, where 
C = C(A, B) is the set of g such that gA#B. As a result: 


Lemma 3. Let G be a locally compact set acting on a locally compact space 
Z. The quotient space G\Z is locally compact if the following condition is 
satisfied: 


(GOP): for all compact sets A,B C Z, the set C(A,B) of g € G such 
that gA#B is compact. 


G is then said to act properly on Z. All closed subgroups of G act properly 
on Z. 

Exercise 5. Suppose that G and Z are metrizable so as to be able to 
use BW. Show by a direct argument that condition (GOP) implies (5) for 
sufficiently small U and U’ (consider sequences gn, Zn and z}, such that 2/, = 
Gn2n, lim Zn, = ¢, lim z/, = c’). 


(iv) Quotient of an invariant measure. This preliminary stage completed, 
let G be a lcg acting properly (on the left) on a locally compact space Z. For 
f € L(Z,K) and z € Z, the function g > f(gz) vanishes outside the set of 
g such that gz € K. As it is compact by (GOP), like in n° 12, (iv) one can 
define a function 


(15.6) Zh | flg2)arg = [is dg 


using Weil’s notation for the left invariant measure dg. This function is G- 
invariant since replacing z by az amounts to replacing g by ga. This leads to 
a function on X = G\Z given by 


(15.7) f%(4) = / FGdeae 


where recall that 2 = p(z). If K is the support of f, the integral vanishes 
outside GK, so that f% vanishes outside the compact set p(K). Finally, it is 
more or less obvious that function (7) is continuous, and thus 


(15.8) fe L(Z) => fle L(X). 
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Lemma 4. The map f +> f% from L4(Z) to L4(X) is surjective. 


First note that any compact set K C X is the image under p of a compact 
subset of Z. For all x = p(z) and all compact neighbourhoods W of z, the 
image p(W) is indeed a compact neighbourhood of «. By BL, K can be 
covered by finitely many such sets p(W;). As H = JW; satisfies p(H) > K, 
the compact set HM p~!(K) answers the question. 

This being so, first note®® that if some f € L4(Z) is > 0 on a com- 
pact set H Cc Z, then f° > 0 on p(H). Hence, for all yg € Ly(X) with 
support p(H), there exists 7% € L4(X) such that y = f%y. The function 
h(z) = f(z)v[p(z)] = f(z) v(Z) also being in L,(Z), it is possible to compute 


y= | How wl p(gz)| Nag = f F(o2)012) jd-g=u v2) | tg2e9= 
= $(2)f°(2) = 92), 


whence the lemma. It obviously also applies to L(Z) and L(X). 
We now suppose that there is a G-invariant measure on Z, i.e. such that 


(15.9) jr (gz)dv(z )= f fe )dv(z 


We will also need functions 
(15.7" fale) = [ tg:\ag= | 1192) Ac(o)arg. 


For all a € G, one readily gets 


(15.10) fg (a-*z) = Ac(a) fe(z)- 


One can show as above that the map f +> fae from L+(Z) to the set 
L4(X; Ag) of continuous, positive solutions with compact support mod G 
of (10) is surjective. The distinction between f© and fg is of interest only if 
G is not unimodular, but that it is precisely what often happens in practice. 


Lemma 5. For f € L(Z), the relation fg =0 implies v(f) = 0. 


Let A be the support of f and B that of a function h € L(Z). The relation 
h(z)f(gz) 4 0 requires z € B and gz € A, hence g € C(B, A), a compact set 
by (GOP). The function (z, g) —+ h(z)f(gz) is, therefore, in L(Z x G). The 
most elementary version of LF then shows that 


59 Every non-empty open set U has > 0 measure with respect to the left invariant 
measure of G. Otherwise, all the cosets aU would be null sets, and hence compact 
subsets of G (apply BW). 


§ 4. Lebesgue-Fubini’s Way 93 


Choosing h in such a way that the function h& equals 1 on the support A 
of f, we find v(f) = v(fch), qed. 

Since lemma 5 shows that the integral v(f) only depends on the func- 
tion fg, the analogue of lemma 4 for the map f +— f@ proves that 


(15.11) V(f) =A(fa) , 


where A is a well-defined positive linear functional on L(X; Ag). It is the 
quotient pseudo-measure of v by G. So, using the integral notation for A, 


(15.11) [t@ee = [aw | reas. 


In what precedes, starting from an invariant measure v on Z, we inferred 
a pseudo-measure \ on X. We could as well have started from A and defined v 
by (10) since fg € L(X; Ag). Clearly this would give an invariant measure on 
Z. Invariant measures on Z thus correspond bijectively to genuine measures 
on X if G is unimodular. 

To apply theorem 22 to the measure v in this case, we set 


(15.12) mene : Hendy: eae). 


This gives for each « € X = G\Z, a positive measure pz on Z. It is the 
image of dg under the (continuous and proper) map g +— gz. Relation (11) 
then becomes 


(15.13) (f) = / pio (FAC). 


Here the situation is particularly simple since (12) shows that «+> p.(f) is 
continuous for all f € L(Z). Assumption (a) of theorem 22, therefore, holds. 
It follows that if a function f on Z is v-integrable, then f is 4.,-integrable for 
almost all x € X, the function «+ ji,(f) is \-integrable and the equivalent 
formulas (12) and (13) continue to hold. Moreover, theorem 23, (i), shows 
that f € L1 (1; Z) if and only if the function g ++ f(gz) is dg-integrable, in 
which case (12) continues to hold. Hence finally, relation (13) can be written 
in form (11) in all cases. This assumes that Z is countable at infinity and 
that G is unimodular. If G is not unimodular, there is an analogous result, 
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but before that integration theory needs to be rewritten for linear functionals 
on L(X; Ag) — an excellent way to spend rainy days. 

Exercise 5. Suppose that Z is a unimodular lcg, that dv(z) = dz and that 
G is a closed subgroup of Z acting by (g,z) +> gz, which gives a positive 
linear functional \ on the space L(G\Z, Ag) of continuous solutions with 
compact support of f(g~'z) = Ac@(g) f(z). Suppose there is a closed subset 
H Cc Z such that (g,h) + gh is a homeomorphism on Z, which enables us 
to identify L(G\Z, Ac) with L(H) by restricting ourselves to H. Show that 


[fea = ff ratdigacn for all f € L(Z). 


(v) An example: the orthogonal group on R". For Z take the space R”, 
for G the compact group O,,(R) of orthogonal matrices (g’g = 1) acting in an 
obvious way on R” and for v the Lebesgue measure dz on R”, which is clearly 
G-invariant [Chap. IX, n° 10, (i)]. For 2’, 2” € R”, there exists g € G such 
that 2” = gz’ if and only if ||z’|| = ||z’’||, where this is the standard Euclidean 
norm. Here the quotient G\Z is R*_, the map p being just p(z) = ||z||. Hence, 
letting dg denote the invariant measure of G normalized by ~(G) = 1, there 
is a unique measure \ on R* such that 


@dz= f ™ ant) [ toes. 


Denoting by e; the first vector of the canonical basis of R”, any point z can be 
written z = t.ue; for some u € G and t = ||z|| = 2, and so f(gz) = f(t.gue1). 
As p is invariant, by theorem 22, the previous formula becomes 


Rv 


+00 
(15.14) fleyaz = f an(t) [| F(tger) dy 


for all f € L‘(R"). This being so, let us apply (14) to the function 
f(az),where a € R4. The left hand side is multiplied by a~”. On the right, 
t is replaced by at. Therefore the measure must satisfy 


ann f r(nantt) = [ F(atyartt), 
whence 
dX(t) = cyt" d*t, 
with a constant c,, > 0. To determine the latter, we may choose the function 
f(z) = exp [-alz||"] . 


The left hand side of (14) can be calculated by LF and equals 1” = 1 
(Chap. VII, n° 28 for n = 1). On the right, f(t.ge,) = f(te.), so that the 
integral in g equals f(te,) = exp(—zt?). Thus the right hand side equals 
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+00 1 
of t” exp (—7t?) d*t = 5m PT (n/2)en . 
0 


and so ¢, = 2n"/?/I'(n/2). Note that t I’(n/2) is readily computed if n is 
even and reduces to I'(1/2) = 1'/? if n is odd. Finally, relation (13) becomes 


(15.15) flejdz= 2 [eae | pegerra 
. z)dz = ——— t"d*t t.ge . 
Rn P(n/2) Jo pene cee 
For n = 1, ['(n/2) = w'/?, whence c, = 2. Here, the group G reduces to two 
maps «++ x and x++ —a, the measure p being induced by assigning weight 
1/2 to each of these two elements of G. Hence, the value of the integral over 
G is 1/2[ f(t) + f(—t)], t’d*t = dt and a known formula is recovered. . . 

For n = 2, in which case R” can be identified to C, cg = 27, the group G 
is simply T and, keeping the notation of Chap. VII, §1, 


+00 
a f(x)dz = an f tat [F [t.e(u)] dm(w) . 


Replacing e(u) = exp(27iu) by exp(iu), replaces integration over T by inte- 
gration over [0,1] and the invariant measure dm(u) by du/2a. The previous 
formula reduces to one enabling us to compute the integrals in polar coordi- 
nates. For n = 3, (3/2) = F'(1/2 +1) = V/7/2, whence c3 = 47 and 


+00 
= 2 
i f(x)dx = an f t at ff (tger)ag. 


As will be seen a bit later, (15) is often written in a slightly different and 
more familiar form. 


All this is very simple because the group G is compact. The question 
would become far more complicated if, instead of the subgroup of GL,,(R) 
leaving the quadratic form )> z? invariant, where the z; are the canonical 
coordinates of some z € R”, one considered the orthogonal group of 
a indefinite quadratic form, i.e. reducible to the form q(z) = )> «2? 
with positive and negative signs €;. In R*, the quadratic form «? + y? + 
2? — ct? of physicists falls in this case, G being the Lorentz group. One 
can then show that the orbits (Witt’s theorem®) are the hyperboloids 
q(z) = Cte, but the group no longer acts properly [even outside the null 
set of “isotropic” vectors, i.e. such that q(z) = 0]. To start with, the 
stabilizer G, of a point z is not even always compact.®! Hence, less brutal 


°° See for example my Cours d’Algébre, exercises 19 and 20, pp. 645-646. 

®1 Tf @ is a non-isotropic vector, R” is the direct sum of Ra and the subspace E(a) 
of orthogonal vectors to a (with respect to q). The stabilizer of a in G is thus 
isomorphic to the orthogonal group with respect to q in E(a). As a result, it is 
compact only if the restriction of g to E(a) is positive or negative definite. For 
the Lorentz form, any non-isotropic vector x is of the form Gei if g(x) > 0 and 
Gea if q(x) <0, up to a factor > 0, where (e;) is the canonical basis of R*. The 
stabilizer if x is compact in the second case, but not in the first. In fact, G acts 
properly on the open set q(x) < 0, but not on the open set g(x) > 0. 
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methods than formulas (6) and (11), which are no longer well-defined, 
need to be used. So, the analytic theory of quadratic forms with integer 
coefficients, which aims, as a first approximation, to find the number of 
integer solutions of q(x) = r for a given integer r, become much more 
complicated. 


(vi) The case of homogeneous spaces. Let us consider the extremely im- 
portant case of a lcgG and a closed subgroup K of G acting on G either 
on the left by (k,g) ++ kg, or on the right by (k,g) > gk. The correspond- 
ing quotient spaces, written K\G and G/K, are by definition homogeneous 
spaces . In the former case, their elements are the cosets Kg, in the latter 
one gK. In this case, condition (GOP) always hold. Indeed, as kg = g’, i.e. 
k = g'g"', C(A,B) = KN BA“, where BA~!, the image of A x B under 
the continuous map (g,g’) ++ g’g!, is compact . Hence so is C(A, B) because 
of the essential assumption that K is closed. As a result, every homogeneous 
space is locally compact. 

On the other hand, any closed subgroup I’ of G, including G itself, acts on 
the left on G/K by the formula y(gk) = (yg)K . This enables us to defined 
new quotient spaces ['\G/K; their elements are the double cosets [gK CG, 
i.e. the sets of products ygk, where y € I’ and k € K. These are also the 
orbits of "x K acting on G by (y,k)g = ygk—!. To express (GOP), note that 
here C(A, B) is the set of (y, k) such that yAk~!#B, i.e. such that yA#Bk. 
By far the most important case is that of a compact subgroup K. Since the 
previous relation implies that ~ belongs to the compact set BK A™!, the set 
of y which satisfy it is then contained in a compact subset M of I’, and 
so are the corresponding k. Thus C(A, B) is contained in a compact subset 
of I x K, and the closure of C(A,B) remains to be shown. But if some 
(Yn, kn) € C(A, B) converge to a limit (7,k), and if one chooses a, € A and 
bn € B such that ynan = bnkn, taking a subsequence if necessary, (a,) may 
be assumed to converge to some a € A and the b, to some 6 € B. Then 
ya = bk, whence (y,k) € C(A, B) and the result. In conclusion: 


Theorem 24. Let G be a locally compact group, K a compact subgroup of 
G and TI a closed subgroup of G. Then I acts properly on the homogeneous 
space G/K and the quotient space I'\G/K is locally compact. 


The general formula (11) applies to any homogeneous space G'/K provided 
kK is unimodular. As we make K act on the right on G, we choose v to be 
the measure d,g of G, invariant under g++ gk. The measure J is determined 
by 


(15.16) [ Oe he PO I f(gk)dk, 


where g = gk. The effects of the operations of G on X are easily calculated. 
Denoting by yu, the right invariant measure of G, (16) becomes 
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wolf) = f £€2)ane). 


Replacing f(g) by f(a~!g) with a € G replaces f*(z) by f*(a7!z) and the 
right hand side of (16) by f f*(a~tz)d\(z). On the other hand, the left hand 
side of (16) is replaced by 


[fed a= Acta) | fo)tg= Acta f (dr), 
where Ag is with respect to G. As f* is arbitrary in L(Z), 


(15.17) a (az) dX(z) = Acta) [ f(2)antz) 


for all f € L(Z). Therefore, the transformation formula is the same as that 
for the measure d,g (case where K = {e}) and could be written 


dX(az) = Ag(a)dX(z) . 


In particular, if the group G is unimodular, the measure X on G/K is G- 
invariant for all closed subgroup K of G. 

With different assumptions on G and K, we get some other analogous 
results, and even a general result without any assumptions whatsoever. See 
N. Bourbaki’s chapter on the Haar measure. 

Exercise 6. Let G be a unimodular Icg, K and H two closed subgroups of 
G such that the map (h,k) + hk from H x K to G is a homeomorphism.® 


Show that 
[ sota= ff somainar. 


(Identify G/K with H, set out explicitly the action of H on G/K and use 
the invariance of the quotient measure). 


These calculations enable us to put (15) in its classical form. Points of 
type ge1, where g € O,(R) = G, are just those of the unit sphere S,,_; of R”. 
Since the stabilizer of the vector e; in G,, is obviously the orthogonal group 
On-1(R) = Gp_1 and since the map g ++ ge; is constant on the cosets gG,-1, 
the homogeneous space G',,/G,,_1 can be identified with the sphere. Thus the 
invariant measure of G,, defines a measure o on it which is invariant under 
the groups G,, and of total mass 1 [apply (16) by choosing the normalized 
invariant measures of G,, and G,_1]. Then relation (15) becomes 


Qqn/2 +00 
(15.18) f(a)dx = ao! rat f f(t.u)do(u). 
R” D(n/2) Jo Sn—1 
° Example: G = GIn(R), K = On(R), and for H take the subgroup of triangular 
matrices whose diagonal entries are > 0. 
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But users do not work with measures o of total mass 1, but with “the surface 
element” d¥7(u) of the unit sphere to obtain the simpler formula®? 


+00 
a f(a)dx -/ td if f(t.u)dd(w) . 


n-1 


As the measure » is invariant under rotations, and hence proportional to 
a, in effect we are multiplying o by 2x"/?/I'(n/2). We thus discovered that 
in R? (resp. R*), the length (resp. surface) in the usual sense of the unit 
sphere is equal to 27 (resp. 47). One could also calculate the volume of the 
ball ||a|| < R. It suffices to apply the previous relation to its characteristic 
function, which reduces the calculation to integrating t”~'dt from 0 to R. 
One would thus recover 7R?, 47. R?/3, etc. 


(vii) The case of discrete groups. The case of a discrete group is particu- 
larly simple and, moreover, important . 


Lemma 6. Let I’ be a discrete group acting on a locally compact space Z. 
The quotient space ['\Z is locally compact, if the following condition holds: 


(GPD) for all compact sets A, B C Z, the set of y¢ I such that yA#B 
is finite. 


If z,z' are two points of Z and U, U' are sufficiently small neighbourhoods 
of z and 2’, then 


(15.19) yUHU! <=> yz = 2’. 


The first proposition is clear. If ['z 4 Pz’, yYUNU’ is empty for all y for 
sufficiently small U and U’ (lemma 2). So (19) follows in this case. If [z = 
Iz’, let V and V’ be compact neighbourhoods of z and z’ . The set C' of y such 
that yV#V’ is finite by (GPD). Let 71, ..., 7p» be the elements of C such that 
yz # 2. As Z is separated, there are neighbourhoods W; C V and W/ c V’ 
of z and z’ such that 7,W; W/ = @. We set U =()W; and U’ = ()\ WY. If 
yU#U', then yV#V’, and so 7 € C. Thus either yz = 2’ or else y = 7; for 
some i. The second case does not occur since YUNU! C yWiNW = 2, qed. 

Under the assumptions of lemma 6, I" is said to act properly discontinu- 
ously on Z, a terminology inherited from the theory of automorphic functions, 
as well as the notation I’, the letter G being set aside for “continuous” groups 
like SZ2(R). 

Some simple consequences follow from condition (GPD). First of all, the 
orbits Iz are discrete and closed in Z. The second claim is clear even if I” 


®3 Arguments following Leibniz consist in calculating the volume of the portion 
of space comprised between the spheres of radius t and ¢ + dt and in the inte- 
rior of the cone with base the infinitesimal surface element dX)(u) of the unit 
sphere. “Obviously”, ¢”~'dtd(u) = t"d*tdX(u). The formula thus follows by 
integration. 
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is not discrete: [’A is closed for any compact set A C Z (lemma 2). On the 
other hand, (19) for z = z’ shows that [zNU = {z} for all sufficiently small 
neighbourhood of z, hence the first claim. 

By (GPD), for all z € Z, the set of y such that yz = z is clearly a 
finite subgroup I, of I’. It is the stabilizer of z in I’. Hence, for any open 
neighbourhood V of z, the set 


U=()w 


vel 


is an open neighbourhood of z, obviously stable under the y € I, . For this 
neighbourhood, (19) becomes 


(15.20) yU#U yz= 2 yU =U. 


To express (11) in the case of a discrete group I’, one may suppose that 


frond =Dae) 


for all functions with compact (i.e. finite) support in I’. Then 


nt (2) = lye), 


and (11) becomes 
(15.21) E Flejde(e)= fan ay eee 


For Z = Rand G = Z acting by (z,n) + z+n, this relation is the cornerstone 
of the Poisson summation formula. Theorem 22 shows that if f is v-integrable, 
then the series )> f(yz) converges unconditionally for almost all z € I'\Z, 
its sum is r-integrable and relation (21) remains valid. 


The general notion of a quotient measure does not seem to have appeared 
in the literature before d’André Weil’s book, which only dealt with homoge- 
neous spaces. He already defined the quotient measure in the way we have 
done, thus giving an example of the simplifications due to the definition of 
measures as linear functionals. He was the first to use it in this context at 
a time when his partisans were very rare. The general case, which is an 
easy extension of the case studied by Weil, is hardly mentioned before the 
N. Bourbaki’s chapter dedicated to the Haar measure, but it was obviously 
known, at least implicitly, by some other others, especially Gelfand and his 
school even if, before 1950, they confined themselves to particular cases such 
as SL,(C). In fact, they were the first to publish the earliest good methods 
to calculate Haar measures by exploiting the existence of subgroups (see ex- 
ercise 6). At the time, some authors were also studying quotient measures 
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for one-parameter groups (G = R) acting improperly on compact spaces, in 
the framework of ergodic theory — a much more difficult question. Chap. V of 
Bourbaki defines such measures for general “measurable” equivalence rela- 
tions. 

Around 1880, the notion of a quotient space by a properly discontin- 
uous discrete group appears, very vaguely of course, in the work of Henri 
Poincaré, Felix Klein and others, in the framework of the classical theory of 
automorphic forms since it is essential in the definition of the corresponding 
Riemann surfaces (Chap. XII). But these authors and their successors, in- 
cluding Carl Ludwig Siegel much later than 1940, never wrote relation (21) in 
the “natural” form that we are now familiar with: they replaced the quotient 
space by a fundamental domain of I’. 

This is how any measurable set F C Z such that, for any function f € 
LYP\Z;d), 


(15.22) fle)dr(e) = fF ple] ave) 
r\Z F 

is called (we still hope for something much better). With this definition, (21) 

becomes 


(15.23) [1 )dv(z de), f(yz)dv(z =D f seoawe) 


ye oe 


for any function f € L1(Z;v). The most obvious case is obtained by making 
Z act on R. One can take F' = [0,1] and thus recover the equality 


te f(z)dz=)— 7 f(ztn)dz 


—oo 


Relation (23) holds if 


(15.24) Z| [ar 


and if, moreover, the sets yF are pairwise disjoint. But this at the very least 
supposes that I" acts without fired points, i.e. that [', = {e} for all z. This 
is the case if one makes I’ act on G since then yg = g implies y = e. It is 
rarely the case in quotients G/K as we will see in the context of modular 
functions.®* In fact, (23) remains valid if one only assumes that 


°4 The general framework of the theory of automorphic forms consists in consider- 
ing a space Z = G/K, where K is compact and G unimodular, and a discrete 
subgroup I" of G. One then studies functions f(z) I’-invariant or functions sat- 
isfying a slightly less simple condition. All cases reduce to vector functions on 
G satisfying f(ygk) = p(k)~' f(g), where p is a finite-dimensional unitary rep- 
resentation of K. By (9) and (21), integration over I’\Z reduces to integration 
over I’\G. Then, if we so wish, we can use a fundamental domain of I in G. See 
Chap. XII for the case G = S'L2(R). 
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(15.25) v(FNyF)=0 forall y#e. 


If this condition holds, YF N7/F = 7(F Ny~1'7’'F) is a null set whenever 
7 #7 because of the invariance of the measure. Since in practice a discrete 
group is always countable,®° (23) then follows from the general lemma: 


Lemma 7. Let X be a locally compact space, a positive measure on X 
and (F,,) a countable family of measurable sets in X. Assume that X is the 
union of sets F,, up to a null set, and that F, 1 Fy is a null set whenever 
p#q. A measurable function f is integrable if and only if 


a x)| du(x) < +00. 


Then 
(15.26) [se )du(a =of f(x) dua 


Indeed, let xn be the characteristic function of F;,. The value taken by 
the function 


2) = > xn(e) 


at x € Fp, is the number of indices q such that x € F,, and so equals 1 in 


R-URnk. 
DAY 


Since the union of F, 9 Fy is a null set, x(x) = 1 almost everywhere on 
F,, hence almost everywhere on X. Hence, for given f, one sets fn = fxn, 
|f| = >o |fn| ae., and as (fr) is the integral of f over F;,, the lemma follows 
from theorem 6. 

The existence of such sets Ff’ remains to be shown when I is discrete, 
an essential assumption to ensure that I’ is countable and apply lemma 6. 
Apart from purely measure theoretic arguments, Poincaré uses a method 
applicable to all cases encountered in the theory of automorphic functions 
of one or many variables. G/K is then a C® manifold, the group G acts 
through diffeomorphisms and in G/K there is a G- invariant Riemannian 
ds*. This gives an indefinitely differentiable distance function d(z, 2’) such 
that d(gz, gz’) = d(z, z’) and with respect to which the balls d(a, z) < r are 
compact submanifolds. The invariant measure v is defined by a G-invariant 
differentiable form of maximal degree, so that (n° 16, exercise 2) any subva- 
riety of G/K has measure zero as in R”. For all 7 £ e, the relation yz = z 
defines a subvariety of G/K, hence a null set if, as is natural, ones assumes 


°5 Every discrete subgroup of a locally compact group countable at infinity is count- 
able since its intersection with any compact set is finite. 
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that the relation gz = z implies g = e for all z. Choosing once and for all some 
w € Z outside the union of this countable family of null sets, we see that, 
for given z, there are only finitely many points z’ € Iz in a ball d(w, z’) <r 
since the orbit of z is discrete. Hence there is at least some z’ € I’z for which 
d(w, 2’) is minimal. So, denoting by F' the set of z such that 


d(w,z) <d(w,yz) forall yer, 


Z = UF. The set F is closed since, for given y, the previous relation 
defines a closed set. To be able to apply lemma 7, we still need to know that 
v(FNYF) = 0 for all y # e. But for all z € FNYF, y~'z € F, and so d(w, z) = 
d(w,y~+z) = d(yw, z). In the cases considered, any equation d(a, z) = d(b, z), 
where a # b, defines a submanifold of measure zero; whence the result since 
w was chosen in such a way that yw # w for all y 4 e. Poincaré in fact proved 
much more. Dieudonné transformed all this into exercises (Chap. XXII, 3, 
exercise 15), without as usual giving any references.©° 


°6 Perhaps C.L. Siegel, Topics in Complex Function Theory, Vol. II (Wiley, 1971). 
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16 — Measures With Respect To a Base Measure A: Integrable 
Functions 


Let X be a locally compact Polish space, A a positive measure on X and j 
a locally integrable function with respect to A [n° 5, (ii)]. By definition, the 
function jf is integrable for f € L(X), whence a measure 


(16.1) v(f) = / f(a)i(a)dd(x) 


on X. When j > 0, the measure dv = jdy is positive, which gives rise to the 
problem of characterizing integrable functions with respect to it. Formula (1) 
irresistibly suggests the answer: a function f is integrable with respect to jd\ 
if and only if so is the function fj with respect to A. Then, 


(16.2) i, f(«).i(a)dX(x) = / f(a)i(w).dX(2). 


The similarity between (2) and the associativity formula of multiplication 
does not, however, make the result obvious, except perhaps for a physi- 
cist identifying d\(x) to an infinitesimal volume element and the measure 
j(x)dA(a) to a mass or electric charge distribution with density j(x) and cal- 
culating in Leibniz’s manner. One could also invoke the latter’s theory of 
pre-established harmony [Chap. IX, n° 12, (ii)]. For mathematicians, results 
explain the notation adopted. 


Theorem 25. Let X be a locally compact Polish space, X a positive measure 
on X andj > 0 a locally \-integrable function. A function f with values in C 
or [—00, +00] is integrable with respect to the measure dv = jd if and only 
if fj is A-integrable. Relation (2) then holds. 

A complex-valued function j' is locally v-integrable if and only if jj is 
A-integrable. The measure with density j’ with respect to v is then identical 
to the measure with density j'j with respect to X: 


(16.3) i f(«).i'()dv(e) = / f(x)()i(2).dX(2) 


for all f € L(X). 


The proof is rather long and comprises various parts. 
(a) As observed at the end of section (ii) of n° 12, it suffices to set Z = X 
and 


(16.4) elf) = J(a) f(a) for all f € L(X) 


© Springer International Publishing Switzerland 2015 103 
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in order to put vy into the form 


(16.5) v(f) = / po (f)dX(2) 


Then theorem 22 shows that, if a complex-valued functions f is v-integrable, 
then f is j.-integrable (i.e. finite at x) for almost all x, the function x +> 
Lx (f) = f(x) is A-integrable, and finally formula (5), i.e. (2), remains valid. 

(b) But theorem 25 also affirms that it suffices that fj € L1(X,A) for f 
to be in L'(X,v). If the function f is v-measurable, then® 


(16.6) v* (\fl) = / |f(x)| j(2).dA*(@) 


by (13.5”). Hence in this case, the condition is sufficient. Thus the proof re- 
duces to showing that if fj ¢ L1(X,.), then f is v-measurable. We might as 
well prove a general relation between v-measurable and A-measurable func- 
tions: 


Theorem 26. Let X be a locally compact Polish space, X a positive measure 
on X, j a locally r-integrable positive function and dy = jdX the measure 
with density j with respect to . Let f be a map from X to a topological space 
Y, S the set of x for which j(x) £0, and fs a map with values in Y, equal 
to f on S and constant on X — S. f is v-measurable if and only if fs is 
A-measurable. 


Lemma 1. For all N CX, 


(16.7) v*(N) =0 <> *(NNS) =0. 


Indeed if the first relation holds, then the function f = y, is v-integrable 
and formula (5) applies. Thus f(x)j(z) = 0 outside a A-null set and relation 
(7) holds. 

Conversely let us suppose that it holds. For any integer p > 1, let xp be 
the characteristic function of the set S, of « € X where j(x) > 1/p. Since 
(x) =p xp(a) for a, 


(16.8) u(f) = (fi) = P*A(FXp) 


for all positive f € L(X). Taking the upper limit in (8) of an increasing 
sequence of functions, one gets the same result for all lsc functions (n° 11, 
(ii), theorem 21), then, taking infima, one sees that 


(16.9) v*(F)>p'd*(Fxp) forall F>0. 


°7 We will see at the end of this n° that in fact relation (6) holds for all f, but this 
does not follow directly from theorem 22. 
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If F = xy, for some N C X, it follows that the relation v*(N) = 0 implies 
A(N 1 S,) = 0 for all p, which proves the lemma since S = L) Sp. 
In particular 


(16.10) u*(X —S)=0. 


Thus the v-measurability of a function f only depends on its restriction to 
S. This is a very natural result... Also 


(16.11) \M(N) =0 => v*(N) =0 


since the first relation trivially implies \*(N.N S) = 0. 

Having settled this point, let us return to theorem 26. Two functions equal 
almost everywhere being simultaneously measurable or non-measurable, by 
(10) f is y-measurable if and only if so is fg. On the other hand, as S is 
A-measurable, if f is A-measurable, then clearly so is fs as well (the converse 
is obvious false if X — S$ is not A-null). 

This being so, suppose that fs is A-measurable. For every compact set 
K Cc X, (Lusin) K = NUUK, with A(V) = 0 and compact sets K,, on 
which the restrictions of f are continuous. Since v(N) = 0 by (11), fs and 
hence f are v-measurable. 

To prove the converse, one can once again confine oneself to the case 
where f is constant outside S. As f is v-measurable, for every compact set 
K Cc X, there is a sequence of compact sets K,, such that N = K —-UK,, is 
v-null and f is continuous on K,. Writing K = (NN S)U(N-NOS)UR, 
one sees that \(N 7S) = 0 since p(N) = 0, and that f is continuous on Ky, 
and on N— NS (as it is constant). Thus f is \-measurable, qed. 

An immediate consequence of the previous theorem is that all A-measurable 
functions are v-measurable, for if f is \-measurable, so is fs since S' is A- 
measurable. 

(c) We can now return to the characterization of v-integrable functions 
f. As was seen by using (6), the proof reduces to showing that if fj is \- 
measurable, then f v-measurable. Given theorem 26, proving the next result 
is sufficient: 


Lemma 2. Let f be a function with values in C or [—co, +00]. Then f is 
measurable with respect to dv = jdX if and only if fj is A-measurable. 


The function j’ (a) equal to 1/j(z) on S and 0 on X—S, being \-measurable 
like j and constant outside $, is y-measurable by theorem 26. So, if fj is A- 
measurable, and hence v-measurable, then so is fjj’, a function equal to f on 
S and 0 elsewhere. Thus f is y-measurable by the same theorem. Conversely, 
if f is y-measurable, then the function fjj’, equal to f on S and 0 elsewhere, 
is A-measurable. Hence so is also fjj’.j = fj, proving the lemma. 

(d) The the proof of theorem 25 will be complete once we have char- 
acterized the locally integrable functions j’(x) with respect to the measure 
dv = jdX. This is readily done since we need to show that, for all f € L(X), 
the function fj’ is v-integrable, i.e. that the function fj’j is A-integrable, etc. 
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A convenient form of the previous result is as follows: 
(16.12) j jd =j ja, 
which makes it obvious, but this is misleading. 


Relation (6) was obtained using theorem 22 and assuming f to be v- 
measurable. Actually, it holds without this assumption. In what follows, we 
will suppose that f takes its values in [0,-++oo] and we will not forget conven- 
tions about +00: 


(16.13) a.too=+oo ifa<0<+o0, 0.+00=0. 
The general relation (13.6) first shows that inequality 
u"(f) 2 (Fi) 


holds in all cases. Thus it suffices to prove the opposite inequality, i.e. by 
definition of the right hand side, that 


(16.14) v*(f) <A*(w) for any Isc function y > fj. 
To prove this we consider the function 
g(x) = p(a)/j(x) if j(a) #0, g(a) =+0c0 otherwise. 


It is A-measurable like y and j. gj = y => fj and hence g > f on S,. This is 
also the case on X — S since then g(a) = +oo. Hence 


u"(f) <v*(g). 
But as g is A-measurable, (6) shows that 
v*(g) = A*(9j) - 


However, gj equals y on S$ and 0 on X — S since j(x) = 0. So gj < » every- 
where. As a result, 


r* (gj) < A*(~), 


which leads to the expected result. 

Functions j > 0 for which jdyu is a bounded measure can be characterized. 
This means that the function 1 must be integrable and hence, by theorem 22, 
that j € L'(X;p). (This result will be later generalized to complex-valued 
functions j). In the case of a general locally compact space, this condition 
should be replaced by 


sup | i(e)dula) Shee, 
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where K varies in the set of compact subsets of X. 

Exercise 1. Assuming that the functions f considered are “reasonable”, 
show that theorems 25 and 26 hold in every locally compact space, Polish or 
not [note that assumption (c) of theorem 22 holds]. 

Exercise 2. (a) Let X be a n-dimensional oriented manifold and w a 
differential form of degree n on X. Suppose that, in every local chart (U, vy) 
compatible with the orientation of X, there is a relation of the form 


6 


w = j(E)dE A... A dé”, 


where €' = v(x)’ and j(€) > 0. Show that the map 
fre f fu 


is a positive measure on X. (b) Show that every submanifold of X of dimen- 
sion < n has measure zero with respect to this measure. 


17 — The Lebesgue-Nikodym Theorem (LN) 


(i) Characterization of absolutely continuous measures. 


Theorem 27. Let \ and v be two positive measures on a locally compact 
Polish space X.°° There is a locally -integrable function j such that dv(x) = 
j(x)d\(x) if and only if every A-null set is v-null. 


Relation (16.2) shows that the condition is necessary. The converse re- 
quires some developments. 

(a) Let us first suppose that v < \ and that there is a compact set K 
such that v(X — K) =0. The first assumption shows that v*(f) < A*(f) for 
any function f > 0. Then, for all f € L(X), 


IW(P)| = |e (FX) SIA CPX) S N2 (Xe) Nol), 


where No is a A-norm. The map f +> v(f) thus extends to a continuous linear 
functional on the Hilbert space L?(X;). 

However, we know (n° 19, theorem 31) that on a Hilbert space H, the only 
continuous linear functionals are the inner products f +> (f|g) with g € H. 
Hence there is a function j € L?(X;A) such that 


v(f) =A(fj) for all f € L(X). 


This proves that dv = jd. We leave it to the reader to show that j(x) > 0 A- 
almost everywhere and that j(a) = 0 ae. outside K. This argument simplifies 


°8 Superfluous assumption provided “null” is replaced by “locally null” in the 
statement: A(N M kK) = 0 for any compact set K implies v(N MK) = 0. The 
proof of the general case, which was unknown before N. Bourbaki, uses transfinite 
induction. 
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Lebesgue’s proofs for the measure named after him and those of Nikodym 
(1930) for general abstract measures. It is due to von Neumann (1940). 

(b) We assume that v < X still holds, but we drop the assumption that 
the support of v is compact. For every compact set kK’, von N’s arguments can 
be applied to the measure dvx(x) = y,,(x)dv(x). Hence there is a function 
jx € L?(X;A) such that dvx = jxd\. One can obviously suppose that jx 
vanishes outside K. 

If K and H are two compact sets (theorem 25), then 


XHWK = XHX x -dY(£) = Xx dVH = Xx XH 
and hence yajxdA = x, judy. As a result (exercise !), 


XHIK =XxJH =XKoHjJ A almost everywhere. 


Let K,, be an increasing sequence of compact sets such that X =U Ky, up to 
a A-null set. Let x, (resp. jn) denote the function y,, (resp. jx) corresponding 
to K,. These functions vanish outside K,,. Then the previous relation shows 
that, for all n, there is a \-null set N, C K, such that 


jJnti(@) =jn(x) forall ae K,-—N,. 


If N =U Np, then there is a function j on X such that j = jn, on K,-K,NN 
for all n. Its value on the \-null set X —) K,, is immaterial. If K is a compact 
set, then j = jx = jn almost everywhere on KM K,,, hence almost everywhere 
on kK. Thus dv = jd. 

(c) In the general case, A and v are bounded above by the measure p = 
A+ v. Hence 


dX = pdp, dv = qdp 


with locally p-integrable functions p and q. Let us consider the sets S = 
{p(z) # 0} and T = {q(x) # 0}. We know from lemma 1 of the previous 
n° that X — S (resp. X — T) is a X- (resp. v-) null set. As \*(N) = 0 is 
assumed to imply v*(V) = 0, one can deduce that v*(X — S) = 0 and hence 
(n° 16, lemma 1) that p*[(X — S)MT] = 0. Replacing q(x)by 0 at each point 
x € (X — S)NMT, which preserves the relation dv = gdp, one may, therefore, 
assume that g = 0 in X — S, in other words that T CS. 

We then consider the function j(a) equal to q(x)/p(x) on S' and 0 else- 
where, whence gq = jp everywhere. Since jp is locally p-integrable, j is locally 
integrable with respect to pdp = dX and 


jd\ = j.-pdp = jp.dp = qd = dv 


[theorem 25, (16.3)], proving the Lebesgue-Nikodym theorem. 


(f) 
(A) 


(ii) Application to complex measures. When 0 < » < p, A*(f) 


<< 
clearly holds for any function f with values in [0,-++co] and so A*(A) < p* 
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for all A Cc X. In conclusion, dX(x) = j(x)du(x) with a function j obviously 
satisfying 0 < j(x) < 1. The next result will follow from this: 


Theorem 28. For every compler measure , there is a positive measure ju 
and a function j locally p-integrable such that 


dN(x) = j(«)dp(x) 


Let us set A = Ay + 7A2, where A; and A» are real and suppose there are 
positive measures such that 


Ay = AL - NN; Ag = AZ — NG. 
The measure 
w= A +A HASH AZ 


is positive and is an upper bounded for the four measures on the right hand 
side. Hence they are all of the form j(x)dy(x). Thus so is ». A far more 
elementary and frequently proclaimed result remains to be proved: 


Theorem 29. Every real Radon measure v on a locally compact X is the 
difference of two positive measures. 


It suffices to prove that a positive measure ’ > v exists. For such a 
measure, the relation 0 < h < f implies v(h) < A(h) < A(f). For all f € 
L(X), this leads one to write 


(17.1) v*(f)= sup v(h). 
O<h<f 
heL(X) 


Thus v(f) < vt(f) < A(f) for all f > 0 and every positive measure A > v. 
If v* is shown to be the restriction of a positive measure to L4(X), not only 
will the theorem have been proved but also that vt is the smallest positive 
measure bounding v above. 

First, 0 < vt(f) < +00 since functions h such that 0 < h < f vanish 
outside a fixed compact set and satisfy ||h|| < ||f||. Obviously, vt(af) = 
avt(f) for 0 < a < +oo. So is 


vi(ft+g)>ut(f)+v*(9) 


for f,g € Li (X) since h’ +h" < f+gifh’<f andh” <q. 

To prove the opposite inequality, let us consider some h € Dpg(X) such 
that O<h< f+g and set h’ =hf/(f +g), h” =hg/(f +g), by prescribing 
that h'(x) = h” (x) = 0 at all points where f + g vanishes. Let us show that 
h’ and h” have continuous compact supports. The first claim is clear. So is 
continuity at all points where f(z) + g(x) > 0. If f and g are zero at x, 
hence also h, continuity follows from the fact that h’ and h” are everywhere 
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contained between 0 and h, the latter being a continuous function vanishing 
at zx. 
This being so, h = h’ + h” with 0 < h’ < f and 0 < h” < g, whence 


v(h) =v (h') + (h") <v*(f) +7 (9). 


Taking the supremum of the left and side for the functions h considered gives 
vt(f +g) <v*(f) +v*(g), which leads to the expected equality: 


(17.2) v*(f+g) =ut(f)+¥"(9). 
We next define v* in Dg(X) by setting 
(17.3) vr(fyaut (f)-vF (f") 


if f = f’—f” with positive f’ and f”. As f’— f” = g'—g” implies f’+ 9” = 
g' +f" and hence, by (2), the same relation by applying v* to the functions 
considered, (3) defines v*(f) without any ambiguities and reduces to (1) for 
f > 0. As a result, vt is a positive result. As (1) shows that v(f) < vt(f) 
for all f > 0, the difference vt — v is also a positive measure, qed. 

Note that this proof applies to other cases. For example, L(X) can be 
replaced by the Banach space F£ of bounded continuous real functions on an 
arbitrary topological space and vy can be chosen to be a linear functional on 
E. 

To justify notations, it is necessary to show that 


+ 


v=vt—v 7 where y- =(-v)t. 


Now, by definition, 


(—v)*(f) = sup —v(h) = sup —[v(f)—v(f —h)] 


O<h<f O<h<f 
=-v(f)+ sup v(f —h) =—-v(f)+vT(f) 
O<h<f 


since ht f —h permutes the functions h considered, whence the result. 

Additional discussions on upper envelopes of a family of measures will be 
found in N. Bourbaki. 

Theorem 28 enables us to define the absolute value of a complex mea- 
sure A: we choose any measure ys > 0 such that dA(a) = j(x)du(x) and set 
d|A|(x) = |j(v)|du(a) . The result is independent of the choice of yu. Indeed, if 
dy = jd’ = j"du"~, writing w = yp’ + pw” gives du’ = y'dp, du” = p"du with 
vy’, yp" > 0. Then theorem 25 shows that 


sit ow 


dA=j.y'du=j'p'.du=j'y" dp. 
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Lyd Bre 


Thus j'y’ = jy" p-almost everywhere, and so also |j/|y’ = |j"|p” . Then 
du! = |i eldu = |i" ody = ("1 du" 


as announced. 
Then for any function f € L(X), 


(17.4) IAA) = le GA) s Hf) = IALUAD » 


or, using more traditional notation, 


| [rooe)|s fire@ane, 


a result that may seem as obvious as the triangle inequality! More generally, 
a function f will be said to be A-integrable if it is|A|-integrable, i.e. if jf is 
p-integrable, in which case, we obviously set A(f) = uGf). 

Choosing as above a measure ys > 0 such that dA = j.du, whence d|\| = 
\jldu, the function 


— iz) /li(@)| if (x) £0, 
ei if j(x)=0 


YK 


w(x) 


is w-measurable, and so locally integrable since it is bounded. As j(#) = 
w(x)|j(x)| for all x and as the right hand side is locally integrable, (16.3) 
applies and shows that 


(17.5) dX(x) = w(x)d|A| (x). 


Conversely if, for a measure 4 > 0, dA = wd with |w(x)| = 1 almost every- 
where, then clearly 4 = |AI. 


Finally, we state a formula, analogous to (1). It will come in useful in the 
next n° and will enable us to define |\| bypassing the LN theorem: for all 
fe L(x), 


(17.6) IA\(f) = sup |A(g)| - 
lalsf 
gEL(X) 


Setting |A| = w and dA = wd, the proof reduces to showing that 


(17.6) u(f) = sup |u(wg)] . 
lalsf 
gE L(X) 


First of all, |u(wg)| < pu(|wg|) < u(f) for all functions g considered. The 
left hand side of (6) is thus greater than the right hand side. To prove the 
opposite inequality, it suffices to show that there are functions g, such that 
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lim p(wgn) = u(f). However, there is an increasing sequence of compact sets 
Ky, contained in the support K of f, such that K — (J Ky, is a null set, and 
on each of them the function w is continuous and with absolute value 1. For 
all n, there is a continuous function h, on X®° equal to @ on Ky. Taking 
its composition with a continuous map from C to the disc D = {|z| < 1} 
equal to z # z on D and in particular on its boundary, one may suppose 
that |hy(x)| < 1 everywhere. The function gy, = hy» f, which is in D(X), then 
satisfies |g,| < f everywhere and wg, = f on K,. Thus, the decomposition 
of integrals on K into integrals on K,, and kK — Ky, gives 


lt won) — w(f)| < | lo() gn (2) — F(@)| dle). 


K-Kn 


As |wgn — f| < 2f everywhere, the right hand side tends to 0, proving (6). 

Exercise. (a) Show that A is bounded if and only if so is |A|. (b) Suppose 
that dA = jdu with respect to a reasonable measure pp > 0. Show that X is 
bounded if and only if j € L'(X;,), and then that 


al = / Li(x)| aux). 


(iii) Lebesgue decomposition. Let y and ps’ be two positive measures on 
X and let us consider the measure v = «+ pv’. The LN theorem enables us to 
write du = pdv, du’ = p'dv for some positive locally v-integrable functions p 
and p’ such that p+ p’ = 1 ae. We set 


q(x) = p'(x)/p(x) if p(x) 40, g(x) =0 if p(x) =0, 
q(x)=0 if p(x) 40, @(z)=p'(x) if p(x) =0. 


These functions are v-measurable. As p’! = pq+q’, 


du’ = pg.dv + q'dv = q.pdv + q'dv = qdut+qddv. 


°° ‘This follows from Urysohn’s theorem: if X is a normal topological space [i.e. such 
that, for any closed disjoint sets A,B C X, there exist open disjoint sets U > A 
and V > Bj, then any function with values in R (or R”) defined, continuous 
and bounded on a closed set A has a continuous extension to X (if X is locally 
compact without being normal, A must be assumed to be compact, which is 
the case here); N. Bourbaki, Topologie générale, Chap. IX, § 4.1. The case of 
metrizable spaces is dealt with in Dieudonné, IV.5: if f is defined on a closed 
subset A of X, the function g equal to f on A and defined outside A by 


d(x, A)g(x) = inf d(x, 9) f(y) 


is continuous and answers the question. The proof reduces to showing that g is 
continuous at all boundary points of A. 
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Setting dA = q‘dv, 
(17.2) dp! = qdu+ da 


follows. The sets S' = {p(a) 4 0}, S’ = {q'(x) 4 0} are disjoint by definition 
of q’. Since du = pdv, u(X —S) = 0 by lemma 1 of n° 16, hence also p(.S’) = 0; 
similarly \(X — S”) = 0 since dd = q'dv. 

Note that X being countable at infinity, any v-measurable function is 
equal ae. to a Borel function (observe that a continuous function on a compact 
subset and vanishing elsewhere is Borel, then use Lusin’s theorem). Hence one 
may assume p and p’ to be Borel, and thus also g and q’, in which case S” is 
a Borel set. 

Relation (7) therefore gives a decomposition of yw’ into the sum of two 
measures, one absolutely continuous with respect to 4 and the other concen- 
trated (obvious definition) on a Borel and p-null set S’. This is the Lebesgue 
decomposition of ju’ with respect to py. It is unique (exercise !). 

Two positive measures ys and yi’ are said to be disjoint when they are 
concentrated on disjoint sets. Setting v = 4+ py’ and du = pdv, dy’ = p'dv 
as above, this obviously means that 


inf [p(x), p’(x)] =0 almost everywhere with respect to v. 


As any measure 4 > 0 bounded above by pw and yp’ is of the form gdyv with 
q <p,p’, this means that 0 is the only measure bounded above by w and p’. 


The notion of disjoint measures can be interpreted in a completely differ- 
ent manner. Let us consider finitely-many n positive measures 44; and their 
sum v. Thus du; = pidv with functions satisfying )> p;(x) = 1,0 < p(x) <1 
almost everywhere with respect to v. We consider the L? spaces of these 
measures. By theorem 25, which is nothing special in this case, any square 
integrable (resp. null) function f with respect to vy is also of this type with 
respect to any of the measures 4; and conversely. Hence one can associate 
classes f; € L?(X;,;) to each class f € L?(X;v). This gives an injective 
linear map 


(17.8) L?(X;v) — L? (X34m) x... x L? (X35 pn) 


compatible with the Hilbert structures of these spaces since 


(Fl) = f Flo\ate)av(e) => f e\gT@dn(a) = (Fila 


for all f,g € L?(X;v). This said, let us show that map (8) is bijective if and 
only if the measures (4; are pairwise disjoint, i.e. concentrated on pairwise 
disjoint sets S;. 

If this condition holds, taking for all 7, a square integrable function f; 
with respect to 4, it may be assumed to vanish outside S;. The function 
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f =>¢ fi is equal p;-ae. to f; for all i since y;(5;) = 0 for 7 4 i. Thus (8) is 
surjective. Conversely, if (8) is surjective, then, for all f; € L?(X;j;), there 
exists f € L?(X; 1) such that 


f(x) = fi(~) js; — almost everywhere. 


For a given 7, let us choose f; = 0 for all j Ai and f; = x,, where K is 
a compact set. Setting N = {f(x) # 0}, uj(N) = 0 and so p;(K MN) =0 
for all 7 4 i. Since f(x) = 1 almost everywhere on K with respect to u;, 
i(k — KON) = 0. The application of this construction to an increasing 
sequence of compact sets whose union is X shows that the union N of the 
Np corresponding to the Kp satisfies 


uj(N)=0 for all 7 Ai,ui(X —N)=0. 


Hence p; and py; are disjoint for i ¢ 7, qed. 
This result can be partly generalized to “continuous sums’ 


v= f add) 


of n° 12 and 13, by associating to each class f € L?(Z;v) = H the corre- 
sponding classes f, € L?(Z; fx) = He. Theorem 22, (13.13), shows that 


(17.9) (Fla) = | (Folge) ante) 


for all f,g € H, which suggests a continuous analogue of a Hilbert direct sum. 
n° 23, (v) will clarify this vague notion encountered in many fields, mostly 
under a form related to Leibniz ’s idea of an integral. But von Neumann has 
passed this way, even if many people seem or prefer to ignore it. 


') 


18 — Continuous Linear Functionals on L?. The L° Space 


Let X be a locally compact space and yz a positive measure on X. For all 
p > 1, Holder’s inequality shows that, if j ¢ LY, the map 


f+ w(f5) 


is a continuous linear functional on the L? Banach space . The same is true 
for p = 1 if j is chosen to be a bounded measurable function. We intend to 
show that all continuous linear functionals on L? are obtained in this way. 
So let v be a continuous linear functional on L? or, equivalently, since 
L(X) is everywhere dense in L?(X; 4), on the space L(X) equipped with the 
LD? norm. 
Since continuity reduces to the existence of a constant M such that 


1/p 
(18.1) PAL < Mit = a | f er ano) 
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for all f € L(X), for any compact set K Cc X, 
WAI < Mu(K)P||f|| if fe LX), 


where || f|| is the norm of uniform convergence. As a result, v is a complex 
measure on X. Assuming M = 1 for simplicity’s sake and writing (1) as 
\v(f)|? < w(|fl?), formula (17.6) defining |v| shows that 

\? 


Ivi(f)” = sup|v(g)|" < supp (|gl”) , 


whence 


(18.2) wif) < wf)? = |IFllp 


for all f € L4(X). Hence v may be assumed to be positive. 

Then let y be a positive lsc function. If @ is the increasing philtre of 
f € L4(X) bounded above by y, the set of f? where f € @ is an increasing 
philtre whose upper envelope is y?. Taking suprema in (2), (2) follows for y. 
Hence, if f is now an arbitrary function with values in [0,-+oo], then 


*(f)? = inf P< inf Pp). 
v*(f) ant vie) nb ee) 


However, the strictly increasing functions t > t? and t + t!/? transform 
every Isc function into an Isc function. The functions y? are, therefore, all Isc 
and upper bounds for f?. In conclusion, 


U*(f)P Sw (FP) 


for any function f > 0. In particular any p-null set is v-null. Thus dv = jdu 
for a locally p-integrable function j > 0. So v(f) = u(jf) for all f € L4(X) 
and v*(f) = u*(jf) for all f > 0 by theorem 25. 

But if j satisfies 


(18.3) wf)? <u" (fF?) for all f > 0, 


so does every positive, locally integrable function and < j, in particular j, = 
inf(j,n) for all n € N and so perforce jn.« = jnX, =J for any compact set 
k. We next need to distinguish between two cases. 

If p > 1, there exists g > 1 such that 1/p+1/q = 1, which can also be 
written p(q—1) = q, whence (j/%~ 1)? = j'7. This function is -integrable since 
it is measurable, bounded and zero outside a compact set (n° 9, theorem 18). 
Replacing j by j/ and f by j/7~! in (8) gives 


we it)? <u G4) 


and as a consequence, p*(j/7) < 1. As this holds for all n, (theorem 8) jy, = 
sup, Jn,K is in L4(X; 2) and ||jx,,||¢ <1 for all K. Hence, if p is reasonable, 
then j is in L4(X; 1) and satisfies ||jl|, < 1. 


116 XI — Integration and Fourier Transform 


If p = 1, relation (3) means that the measure ps — jdy is positive, and so 
j(z) < 1 almost everywhere. 

Hence, coming back to a complex measure v, relation (1) shows that 
dv = jdw where the locally integrable (complex) function j satisfies ||j||, <M 
if p > 1, and |j(x)| < M almost everywhere p = 1. By Hélder’s inequality,the 
expression ji(fj) is then well-defined for all f € ZL? and is a continuous 
linear functional on L?. Since it coincides with the given linear function on 
the everywhere dense subspace L(X), relation v(f) = (fj) is true for all 
f € L*. In conclusion, and assuming pz to be reasonable: 


Theorem 30. For every continuous linear functional v on the Banach space 
L?(X;) with 1 < p < +00 (resp. p = 1), there is a function j € L9(X; 1) 
(resp. bounded) such that 


(18.4) u(t) = w(F5) 


for all f © L?(X;p). |v(f)| < M||f\lp for all f © L°(X;p) if and only if 
llillq < M (resp. |j(x)| < M almost everywhere). 


This shows that for p > 1, the norm of the given linear functional v on 
L? is 


(18.5) llilla = sup |o(f)| /Ilfllp - 


For p = 1, L4 needs to be replaced by the space L®(X; y1) of classes of bounded 
measurable functions with respect to 4. To get a relation similar to (5) in 
this case, denote by ||j||.o the smallest number M such that |j(x)| < M ae. 
Its existence is immediate since, if M is the infimum of the set of M’ such 
that |j(a)| <M’ ae., then |j(a)| < +1/n ae. for all n and so |j(x)| < M ae. 

Theorem 30 says that the dual” of L? is L4 for 1 < p < +00. But it 
should not be thought that the dual of L® is L'. The arguments used to 
prove theorem 29 fall apart in this case for the simple raison that L(X) is 
not everywhere dense in L®. Hence by the general Hahn-Banach theorem,”! 
there are continuous linear functionals 4 0 on L® that vanish on L(X). Such 
a form is obviously not defined by a measure on X. This is already the case 
if ys is the measure pu(f) = >> f(x) on N though, in this apparently trivial 
case, it is impossible to explicitly construct a continuous linear functional on 
L® (the space of bounded sequences) not defined by some j € L?. For this, 


70 The topological dual E’ (not to be confused with the algebraic dual E*) of a 
Banach space EF is the space of continuous linear functionals on E. 

71 In other words: if Ho is a closed vector subspace of a Banach space H, there is 
a family (u;) of continuous linear functionals on H such that 


x Ee Ho ui(x) =0 for all z. 


Dieudonné (XII.15, exercises 3 and 4) proves a more general result by assuming 
the space is separable, which is the case of L° only if « reduces to a finite sum 
of Dirac measures. 
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“ultrafilters” from N. Bourbaki’s Topologie générale or similar procedures 
founded on transfinite induction need to be used. This does not prevent 
the space L°, a space that should be handled prudently, from playing as 
important a role as L! and L? in analysis. 


Corollary. A locally -integrable function j is in L?(X;) for some p € 
[1, +00] if and only if there is a constant M such that 


IG f)|<M.Ng(f) for all f € L(X). 
Then |lj||p < M. 


Indeed, the map f + u(jf) is a linear functional on L(G) which, being 
continuous with respect to the norm N,, can be extended to L4 if q¢ < +00. 
This gives a j’ € L? such that wGf) = u(j/f) for all f € L(G), proving the 
corollary in this case. If g = +00, i.e. if p = 1, in which case Ng(f) = ||flloo < 
| fl, where this is the uniform norm of f, then the measure f +> p(jf) is 
bounded and the norm < M. Thus Ni(j) < /, qed. 

Exercise. For any function yp € L™®(X;,4), set M(y)f = vf for all f € 
L?(p < +00). Show that all continuous operators on L? commuting with the 
M(y) are of the same type. 
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§ 6. Spectral Decomposition on a Hilbert Space 


19 — Operators on a Hilbert Space 


(i) Definitions, continuous linear functionals. Recall that a Hilbert space H 
is a complex vector space equipped with an “inner product” (aly) satisfying 
the following conditions: 


(H 1) the function x + (2|y) is linear for all y, 


H2) (aly) = (y|a) for all x and y, 

H 3’) (aa) is always > 0, 

H 3”) (a|x) =0 implies x = 0, 

H4) H is complete with respect to the distance 
d(x,y) = (x —yla—y)'? = lle —all- 


In practice, a complex vector space H is often equipped with a positive Her- 
mitian form, ic. amap B:H x H — C satisfying conditions (H 1), (H 2) 
and (H 3’), but not the following ones. The Cauchy-Schwarz inequality 


|B(x,y)|" < B(x, x)Bly,y) 


then shows that the set N of x such that B(z,x) = 0 is a vector subspace 
of H and that B(x, y) only depends on the classes of x and ymodWN. Hence 
denoting by x ++ xp the map associating to each zx its class rp € H/N 
provides the quotient space with a positive Hermitian form satisfying (H 3”) 
as well as with a distance. To get a proper Hilbert space, it remains to replace 
H/N by its completion with respect to this distance. By abuse of language 
it will be called the completion of H with respect to B. The spaces L?(X; 1) 
of integration theory are obtained in this way: one endows L(X) with the 
Hermitian form 


B(f.9) = f FleaCebdy(e) 


and then one proceeds to its completion. Here, proceeding to the quotient 
corresponds to going from functions to classes of functions defined up to null 
sets. 

In a complex vector space, every Hermitian form B(x,y) satisfies the 
identity 


(19.1) AB(z, y) = Bla t+y,2+y)-Ba-—y,r-y) 
+B(x + iy, x + ty) — Bla — ty, x — ty) 


following from axioms (AH 1) and (AH 2). It is frequently used. For a Hilbert 
inner product, it is written 


(19.1’) 4 (aly) = |le + yll? — [lx — yl? + lle + tll? — lle — tyll?. 
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The Cauchy-Schwarz inequality shows that, for all a © H, the map tH 
(z|a) is a continuous linear functional on H. Conversely: 


Theorem 31 (M. Fréchet, 1907). For any continuous linear functional 
f on H, there is a unique a € H such that f(x) = (ala) for alla € H. 


One may assume that f 4 0. The subspace € defined by f(a) = 0 is then 
closed and distinct from 1. Let us suppose that there exists b 4 0 orthogonal 
to €; the relation f(x) = 0 then implies (|b) = 0, whence f(b) 4 0. So one 
may assume that f(b) = 1. Then 


and thus « — f(x)b € €. So (a|b) = (f(x)b|b) = f(x)(b|b). The vector a = 
b/(b|b) answers the question. 

Hence the proof reduces to showing the existence of b. This is a conse- 
quence of lemma 2 below. 


Lemma 1. Let C' be a closed conver set in H. For alla € H, there is a 
unique a’ € C such that 


(19.2) |la — all < |la’ — all 


for all x € C other than a’. 


Indeed, let m be the distance from a to C' and let us choose elements 
ay, € C such that lim ||a, —a|| =m. Applying to a— x and a—y, the identity 


(19.3) lle + yll? + le — ll? = 2 (ell? + Ilyll?) 5 


which follows from formal properties of the inner product, shows that 


IIx — yll?/4 = 5 (lla — a]? + lla — yll?) — lla - 5( +y)I) 


1 
2 


for all x and y. For « = ap and y = ag, $(x + y) € C, so that the last term 
is > m. Thus 


la — ag? /4< 5 (lla— apll? + lla — agll?) — mr, 


an arbitrarily small result for large p and gq. Therefore, (a,,) converges to some 
a’ € C obviously satisfying ||a — a’|| = m. If a” € C is, like a’, the minimum 
distance from a, then by (3) 


1 
2 


ja’ — al"? /4 = (lla - a? + la— aI") - lla - 5 (a +0") |. 


The first term on the right hand side is equal to m and the second term is 
>m. The left hand side is thus < 0, ged. 
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Lemma 2. Let € be a closed vector subspace of H. Then H is the direct sum 
of E and of the orthogonal complement of €. 


For all a € H, let a’ be the unique point of € at a minimum distance from 
a. For alla € € and all A €C, 


lla — (a + Aw) ||? = la — a’ ||? + 2Re [A (a —a'|z)] + AP |||? > la — a’? 
and so 


|Al? ||x||? + 2Re [A (a — a’|x)] > 0 


for all A € C. Thus (a — a’|x) = 0 for all 2 € €, which proves that a —a’ is 
orthogonal to €. The sum of € and its orthogonal complement is direct since 
(a|x) = 0 for all common vectors, qed. 

Denoting by €+ the orthogonal complement of €, a corollary of lemma. 2 
says that conversely 


E=(Et)~ 


since H =E@ Et = E+ @ (E+)+; however, € c (E+)+. 


Theorem 31 also has a very important corollary: 


Corollary. Let H and H’' be two Hilbert spaces, D and D’ subspaces every 
dense in H and H’, and B: Dx D! —> C a sesquilinear form’. There is a 
continuous linear map A:H—> H' such that 


B(z,y)=(Az|y) forw~eD, yeD’ 
if and only there is constant M such that 
\B(e,y)| < Mlleallilyl| fore D, yer’. 
Necessity is obvious. Conversely, the stated condition shows that, for all x € 
D, the linear functional y 4 B(x,y) is continuous on D’, and so can be 


extended to H’. Thus B(aly) = (a2’|y) for some unique x’ € H’ obviously 
depending linearly on x. Setting x’ = Az, 


|| Axl] = sup |(x'y)| /llyll = sup|B(a,y)| /Ilyll < Mile), 


qed. 

Throughout this chapter, the notation £(H) will denote the set of con- 
tinuous linear maps (“operators”) from H to H. It is a Banach space with 
respect to the norm”? 


"2 This means that the partial maps «+> B(x,y) and y+> B(a,y) are linear. 
3 Tt should be pointed out that the vector x = 0 is excluded from the relation. I 
prefer relying on the reader’s common sense. 
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|| Al| = sup || Aa|/la|]. 


(ii) Orthonormal bases. A family (e;) of vectors in a Hilbert space 1 is 
said to be orthonormal if (e;|e;) = 0 or 1 according to whether 7 and j are 
distinct or equal. Every Hilbert space has orthonormal bases. This is how one 
calls an orthonormal family (e;);¢7 such that 0 is the only vector orthogonal 
to all e;; in other words, any maximal orthonormal family. Lemma 2 shows 
that the closed subspace generated by the vectors e; is H. For a finite subset 
F of I, letting Hp denote the subspace generated by the vectors e; (i € F), 
and for all « € H, setting €; = (z|e;), the orthogonal projection zp of x onto 
Hp is clearly the partial sum of the series }> &;e; extended to F'. Thus 


ole? = lel? - eel? < llel?- 

icF 
The set of 7 such that €; 4 0 is, therefore, countable. Moreover, as the finite 
linear combinations of the vectors e; are every dense in H, enabling us to 
approximate «x by the points x (points in Hp at a minimum distance from 
x), taking the limit, 


le? = So 1&P? = SP I(ales)? 


and 


where the limit is extended to all finite subsets F of J, as in the case of 
unconditional convergence. A learned way of putting this would be to say 
that H is isomorphic (obvious definition) to the space L?(I;.) where I is 
equipped with the discrete topology and where the measure yu assigns the 
weight 1 to each point of J. 

The previous formula does not mean that the series 57> &e; converges 
absolutely to x: for this to be the case, it is necessary that S> |&;| < +00. 

Exercise 1. Suppose H = L?(X;) where X and yp are arbitrary. Let 
K(a,y) be a square integrable function with respect to the product measure. 
(a) Show that there is continuous linear operator Ax on H such that 


(Ax fla) = [f Ke) Food (e)auty) 
for all f,g © H and that 
Agfa) = [ K(e.w)Fluduly 
for all f € H, the integral being convergent for almost all x. (b) Let (e;) 


be an orthonormal basis for H. Show that the functions e;(x)e;(y) form an 
orthonormal basis for L?(X x X;, x 2). (c) Set 
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AKe; = ) Aijesz 


with aj; € C. Show that >); ; |ai;|? < +oo. Converse? 

In the general case, the existence of orthonormal bases can only be proved 
by using transfinite inductions (Zorn’s theorem). But when H is separable, 
some may be constructed by applying the Schmidt orthonormalization pro- 
cess: choose linearly independent vectors a, whose linear combinations are 
everywhere dense in and change them as follows: replace a; by aj/|lai||, 
subtract from az its projection on the subspace generated by a, and divide 
the result by its norm. subtract from a3 its projection on the subspace gen- 
erated by a; and az and divide the result by its norm, etc. On could write 
down explicit formulae; they can be useful at times: 

Exercise 2. Since any continuous function on X = [0,1] is the uniform 
limit of polynomials, these are everywhere dense in the space H = L?(X) 
of Lebesgue measure. Hence an orthonormal basis for #1 can be obtained by 
orthonormalizing the sequence of functions 1, t, t?,... Calculate the functions 
obtained. 

Exercise 3 (von Neumann). Let X be a locally compact space, p41 a posi- 
tive measure on X, H a separable Hilbert space and a,(x) a sequence of 
measurable maps from X to H. Suppose that, for almost all xz, the closed 
subspace of H generated by the maps a,(~) is the all of H. Show that there 
are measurable maps e,(2) from X to H such that, for almost all x, the 
family of maps e,,(2) is an orthonormal basis of H. 


(iii) Adjoints, Hermitian operators. Now consider a continuous linear op- 
erator A in H, or more generally a continuous linear map from H to another 
space H’. The form B(x, y) = (a|Ay) is accountable for the corollary of the- 
orem 31, with M = ||A||. This gives a continuous operator A* : H’ —> H, 
the adjoint of A, characterized by the identity 


(19.4) (Azly) = (2|A*y) . 
We then deduce that 
(19.5) (A+ B)* = A*+B*, (AB)*=B*A*, (A*)*=A 


as in finite dimension. Since it has been proved above that ||A*|| < || Al], the 
last equality shows that actually 


(19.6) |A*|| = |All. 

The sequence also satisfies the essential property 

(19.7) ||A* Al] = |All’. 

“4 The terms “operator” and “map” are synonymous. The former, in use since 


long before the latter and still very widespread, is almost exclusively employed 
to denote a linear map from a Banach space H (or a subspace of it) to itself. 
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To see this, it suffices to show that the left hand side is greater than the right 
hand one. Now, for all « € H, 


(A* Aa|a) = (Aa|Ax) = ||Aarl|?, 
whence 


|| All? = sup || Az||?/||x||? = sup (A* Aa) /||2||? < ||A*All , 


qed. 

In this chapter, we will repeatedly make use of self-adjoint operator al- 
gebras. These are sets A of operators that are vector subspaces and subrings 
of £(H) and such that 


TEA=>T* CA. 


Lemma 3. Let E be a closed subspace of H and A a self-adjoint operator 
algebra on H. The following properties are then equivalent: 

(i) € is A-invariant;” 

(ii) the operator P of orthogonal projection onto E commutes with all 
TeA. 
The orthogonal complement of E is then A-invariant. 


(i) = > (ii): if x is orthogonal to €, then 0 = (a2|Ty) = (T* aly) for all 
y € € and all T € A, so that the orthogonal complement of € is A-invariant. 

(ii) ==> (i): the elements x € € are characterized by the relation Px = x, 
which implies that Ta = TPx = PTx. Hence Tx € € for all T € A, qed. 

(Continuous) operators H such that H* = H are said to be Hermitian. 
They are characterized by the fact that 


(19.8) (Ha|x)€R forall xeH 


or that (Hay) is a Hermitian form on H. If a closed vector subspace € of H 

is H-invariant, so is its orthogonal complement (lemma 3). Since (Haly) = 

(a|Hy), the orthogonal complement of Im(H) is Ker(H) and that of Ker(H) 

is the closure (which is not generally closed) of the subspace Im(#). 
Positive Hermitian are defined by the condition that 


(19.9) (Ha\x)>0 forall eH. 


This is the case of H = A*A for all A € L(H). We write H < H’ when 
H' — His positive, i.e. if (Ha|x) < (A’a|x) for all x. 


> This means that every T € A maps E€ to €. 
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If H > 0, (Hzly) is a positive Hermitian form satisfying the Schwarz 
inequality 


(19.10) |(Haly)|? < (Ha|x) (Hyly) . 
It follows that 
(19.10’) || H || = sup |(H2|x)| / (2|x) - 


Hence ||H|| < ||H’|| if0< H < H’. 
Relation (10) also shows that 
(19.10”) Hx =0 <> (Azz) =0. 


It follows that Ker(H’+ H”) = Ker(H’)NKer(H”) if H’ and H” are positive. 

If € is a closed vector subspace, the operator Ps, which associates to all 
x €H its orthogonal projection on €, is Hermitian positive since, « — Pex 
being orthogonal to Pex, 


(Pex|x) = (Pex|Pex) > 0. 
The relation 
(19.11) Pra PSP 
is easily seen to characterize these operators, the corresponding subspace & 


being the image of # under P. 


(iv) Spectrum of a Hermitian operator. In finite dimension, hermitian 
operators H are known to be diagonalizable. There is even an orthonormal 
basis whose elements are the eigenvectors of H, the corresponding eigenvalues 
being real. In infinite dimension, the situation is not so simple, but results 
heading in that direction can be easily obtained. 


Lemma 4. Let H be a Hermitian operator on a Hilbert space H. Then H—¢ 
is invertible for all ¢ ¢ R. 


If¢ =a+if with 6 £0, then H—¢ = B[(H-—a)/B—71|. As (H—a)/B8 is 
again Hermitian, it suffices to prove the lemma for ¢ = 7. First of all, H —7 
is injective since the imaginary part of 

((H — i)a|x) = (Hale) — ila? 
is ||x||?. To show that H — 7 is surjective, start with the relation 


||(2 — aal|? = Ae? + [lal]? > lle? 


If, for a sequence (x,), the sequence with general term y, = (H — i)ay, 
converges, and so satisfies Cauchy’s criterion, so does (x,,). It follows that 
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H —i maps H onto a closed subspace H’ of H. Any y € H orthogonal to 
H' must satisfy ((H — i)x|y) = 0 for all x, and so (Hay) = i(aly). Hence 
(Hyly) = i(yly), which gives y = 0 since the left hand side is real. Finally, 
the inequality used above tells us that H — i is bijective and its inverse is 
continuous, qed. 

Lemma 4 enables us to associate the Cayley transform 


U=(H-1)\(H +2)" 
to H. Then UH = HU and, by (5), 
U*U =UU* = 1. 


This relation, which defines unitary operators and means that «+> Uz, is an 
isomorphism from H onto H: U is bijective and preserves the inner product. 
The definition of U from H shows that H(1—U) = i(14+U). To deduce that 


(19.12) H=i1+U)1-U)"', 


one needs to show that 1 —U is invertible. Now, (1 — U)a = 0 implies that 
(1+ U)x = 0. Thus Ux = 0 and so x = 0. On the other hand, the equation 
a = (1—U)z, where a is given, is equivalent to 


a=a2—-(H-i)(H+i)""'2, 


i.e. to (H +i)a = 2ix and so always has a solution. As a result, 1 — U is 
bijective, which justifies the formula. As 


(1-U)* =2i(8 +3)7+, 


the inverse of 1 — U is continuous. 
Conversely, (12) defines a Hermitian operator for every unitary operator 
U such that 1—U is invertible. See n° 23, (ii), exercise 4. 


For a continuous operator A on H, the set of ¢ € C for which A —¢ is not 
invertible is called the spectrum of A, and written Sp(A). We will see in the 
next n° that it is a non-empty compact set. For a Hermitian operator H, the 
infimum and supremum can be determined by introducing the numbers 


(19.13) my = inf (Haz) /(2|z), My = sup (H2|z) / (2|x) 
contained between —||H|| and +||H||. So saying that H is positive mean that 


Lemma 5. The spectrum of a Hermitian operator H is contained in the 
interval [mx, My] and contains its endpoints. 


Let us suppose that for example ¢ < my and replace H by H — ¢, which 
subtracts ¢ from my and My. For the new operator, my > 0 and the proof 
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reduces to deducing that H is invertible. As (Ha|x) > my(a|x), H is injective 
and my||x|| < ||Ha||, so that the image of H under H is closed. A vector y 
orthogonal to it must satisfy (Ha|y) = 0 for all x, and so (2|Hy) = 0, whence 
Hy = 0. Thus y = 0 since H is injective. The image of H is, therefore, (lemma 
2) equal to H. For ¢ > My, apply arguments to the operator ¢ — H. 

It remains to show that H — ¢ is not invertible for ¢ = mg or else that 
if a positive Hermitian operator H is invertible, then my > 0. Setting m = 
|| HI], by (10), 


m = inf ||Ha||/||x|| = inf |(Haly)| /||x|]-[yl], 
whence 


m? = inf |(Haly)[? /[lel? yl? 
< inf (Hale) (Hyly) /lllall2llall? 
= inf (Hale) /|lel!?. inf (Hyly) /liyll? = m3y- 


Thus my > m > 0 (and in fact equality holds), qed. 

We will show later that, short of being diagonalizable as in finite dimen- 
sion, any Hermitian operator H has the following property: there are closed 
H-invariant subspaces H’ other than 0 and H. For all r > 0, the spectral 
theory of n° 22 even enables us to decompose H into a direct sum of finitely 
many pairwise orthogonal, closed H-invariant subspaces H;, and such that 
for each of them there is a scalar 4; € R for which 


||Ha —A42|| <rlla|| for all « € H;. 


Thus the operator H satisfies ||H — ;|| <r on each H;. This is an approxi- 
mative form of diagonalization, precisions about which we will given in n° 24 
by constructing “virtual eigenspaces” that are not contained in H. 


(v) Weak topology. Recall (Chap. III, Appendix) that the seminorm on a 
complex vector space H is a function p(x) > 0 satisfying identities 


p(ax) = |a|p(z), plet+y) < p(x) + ply). 


Any family (p;); € I of seminorms enables us to define a topology on H. 
which is compatible with its vector structure and with respect to which the 
functions p; become continuous: any set defined by finitely many inequalities 


(19.14) lpi(x) — aul] <i, 


where r; > 0 and a; € C are given, being necessarily open, it suffices to define 
the open subsets of H to be the finite and infinite unions of sets (14). That 
the axioms hold is readily verified. In fact one only does the bare minimum 
to make the functions p; continuous. The Hausdorff axiom holds if and only 
if 0 is the only vector for which p;(#) = 0 for all 4. 
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The topology on a Banach space is obtained through this process by using 
the seminorm p(x) = |||]. But there is another way of obtaining it, consisting 
of choosing functions 


rr |f(a)], 


where f varies in the set H’ (topological dual of H) of continuous linear 
functionals on H. This gives the weak topology on H, defined by the norm 
which most authors call strong topology . Since |f(x)| < M||a|| for all f € H’, 
all open (resp. closed) subsets F of H with respect to the weak topology are 
clearly of the same type with respect to the strong topology. The converse 
holds if E is a vector subspace for if E is (strongly) closed, it is defined 
by a family (in general infinite) of equations f;(a) = 0, where the f; are 
continuous linear functionals (a consequence of the Hahn-Banach theorem). 
It is, therefore, also weakly closed. In the case of a Hilbert space, lemma 2 
enables one to avoid (or prove) Hahn-Banach. 

In this case, H' = H by theorem 31. Thus the weak topology is obtained 
by doing the bare minimum to make the function 


continuous for all u € H. Hence saying (for example) that a sequence x, € H. 
converges weakly to some x € H means that lim(x,,|w) = (a|u) for all w. 

Choosing an orthonormal basis (en), > |(en|2)|? < +00 and so lim(e,,|”) = 
0. Thus e,, converges weakly to 0, but not strongly since ||e,|| = 1 for all n. 
The identity 


Izu — ||” = llen||’ — 2Re(enlz) + ll, 


shows that a sequence x» converging weakly to a limit x converges strongly 
to x if and only if lim ||z,|| = ||2||. 

There are some useful consequences of Baire’s theorem [n° 15, (ii), note 
53] in this field. These are only particular cases of results holding for far 
more general topological vector spaces, the Fréchet spaces, but which can 
essentially be proved in the same way; see N. Bourbaki’s book on the topic. 
In all cases, countability and Baire’s theorem play a crucial role. 


Lemma 6. Let (a,,) be a sequence of vectors such that lim(an|y) exists for 
ally. Then sup ||an|| <-+oo and the sequence (ay) converges weakly. 


Setting f,(a) = (an|x) and f(a) = lim f,(x), the proof reduces to showing 
that the linear functional f is continuous, and to obtain this, that the norms 
of f, (i.e. of a,) are bounded. For all N, let us consider the set Fy of x 
for which |f,(a) — fg(x)| <1 for all p,q > N. Cauchy’s criterion shows that 
H =U Fw. However, the sets Fy are closed since the fo — fg are continuous. 
Thus one of the Fy (Baire’s theorem) has an interior point a. So there exists 
r > 0 such that 
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lal <r = |fp(a + 4) — fale + @)| <1 


for all p,q > N. As |fp(a) — fa(a)| < 1, it follows that ||f, — fql| < 2/r for 
all p,q > N, whence sup ||f,,|| < ++oo. Therefore, the linear functional f is 
continuous, and so there exists a € H such that lim(a,|x) = (alx), qed. 


Lemma 7. Let (A,,) be a sequence of continuous linear operators such that 
lim(A,2|y) exists for alla, y. Then sup ||A,|| < +00 and there is a continuous 
linear operator A such that 


lim (A,rly) = (Aaly) - 


The corollary of theorem 31 tells us that it is sufficient to prove the first 
proposition. By the previous lemma, we already know that sup ||A,,2|| < +00 
for all a € H, which is sufficient to prove the lemma [n° 15, (ii), note 53). 


Lemma 8. Let H, < Ho < ... be an increasing sequence of Hermitian 
operators. Suppose that sup(H,2|x) < +00 for all a € H. Then there is a 
continuous linear operator H such that lim ||Hx — Hy2\| =0 for alla eH. 


As Hyg = Hy — Hp = 0 for q > p, 


|(Apaely)|? < (Apgt|r) (Apayly) - 


Hence like the sequences (H,x|x) and (H,yly), (Hyxly) satisfies Cauchy’ 
criterion, which shows the existence of some continuous H such that (Hz|y) = 
lim(H;,x|y). Obviously, H > H,, for all n, whence 


|(Ha — Hyaly))? < (Hx — Hpalx) (Hy — Anyly) 
< ||H — A,,|| .|lyl|. (Ha — Hn2|x) 


et || Hx — H,,2\|? < ||H — H,,||(H2 — H,,x|x). However, sup ||H — H,,|| < +00 
by lemma 7. 


(vi) Hilbert-Schmidt operators. In finite dimensional, any operator A has a 
trace Tr(A): this is the sum of its diagonal entries with respect to an arbitrary 
basis of the space H considered. The positive Hermitian form 


= S- aigdiz 


(obvious notation) transforms C(H) into a new Hilbert space. To make the 
symbol Tr(A* A) well-defined when A is a continuous operator on an infinite- 
dimensional Hilbert space H, the most naive idea is to choose an orthonormal 
basis (e;) of H and to set, as in finite dimension, 


Tr(A* A) =) * (A*Ae;|e:) = |] Aeil|? 
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At the very least, this supposes that 
(19.15) S| ||Aeill? < +00 


and that the the sum is independent of the chosen basis. However, for any 
orthonormal basis (e%), 


> [Ae = 53 | (Aere§) |? = 5 |(e:]4*e4) |? 
i ij 43 
= 30 |4°ej|/’ . 
j 


Hence if (15) holds for a particular basis, then > || A*e’|/? < +00 in all bases. 
So applying arguments to A*, we see that S> || Ae’! ||? < +00 in all bases, qed. 

Condition (15) defines the Hilbert-Schmidt operators (HS), a very impor- 
tant class of operators in most analysis problems and which includes all finite 
rank operators, i.e. for which the dimension of Im(A) is finite. Exercise 1 of 
section (ii) explains their historical origin. 

The sum of two HS operators is HS thanks to the Cauchy-Schwarz inequal- 
ity applied to the “square integrable” sequences ||Ae;|| and || Be,|| (Chap. II, 
n° 15, corollary of theorem 7). The set £L2(H) of these operators is, there- 
fore, a vector subspace of £(H). If A is HS, clearly so is A* as well. For all 
P€L(H), ||PAe;|| < || P]|.|| Ae:||, so that P-A is a HS operator, and hence so 
is A* P* as well. Replacing A by A*, in conclusion, if A is HS, then PAQ is 
HS for all P and Q. In other words, £2(H) is a two-sided ideal of £(H). 

It is useful to observe that, for every family of vectors (a;) such that 
> |la;|]?_ < +00, there is a unique operator A € Lo(H) such that a; = Ae; 
for all 2: it suffices to set 


Ar = S- Eas for x = Ne Ge 2 
tel wel 


Then 


Sled lla < (Soe)? (Seat?) ””, 


so that the series defining Az converges absolutely. Moreover, 
||Aal] < $0 [ésasl| < Mlar| 


where M = (37 |\a;||?)!/?. A is clearly a HS operator. 

If A and B are HS, then Tr(B* A) = 5>(Ae;|Be;) is well-defined thanks 
to Cauchy-Schwarz. It does not depend on the choice of the basis: this was 
shown for B = A, and the general case can be deduced by using identity 
(1). It remains invariant if A and B are replaced by U~!AU and U~!BU, 
where U is unitary: this amounts to changing orthonormal bases. The same 
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calculations using the matrices of A and B with A and B with respect to an 
orthonormal basis show that 


Tr(AB) = Tr( BA) 


like in finite dimension. Tr(B* A) is an inner product on the space £2(H). 
The corresponding norm is written 


Alla = Tr(A*A)"? = (So llAedl?) 


£4(H) is complete with respect to this norm, and so is a Hilbert space. Indeed, 
associating the matrix a;; = (Ae;|e;) to each A € L(H) gives ||Ae;||? = 
>> |aij|?. The space of Hilbert-Schmidt operators equipped with its inner 
product is, therefore, isomorphic to the space of all families (a;;) such that 
> |aij|? < +00. Like any L?(X; 1) space, it is complete, even and especially 
if X =I x I is discrete... If Hj; denotes the operator mapping e; onto e; 
and and other basis vectors to 0, this clearly gives an orthonormal basis for 
L2(H), the a;; being the coordinates of A with respect to this basis. 

As any norm 1 vector « € H belongs to an orthonormal basis, ||Az||? < 
Tr(A* A) and so 


|All < Tr(A* A)? = || Al], if A is HS. 
Also 
| PAQl2 < ||P -All2-@ll 


for all P,Q € L(H). 

Exercise 4. Let be a positive measure on a space X and K(x,y), H(2, y) 
two square integrable functions with respect to 4 x ju, whence (exercise 1) 
HS operators 


Agfa) = f K(e.w)Fluduly 


and Ay on H = L?(X; 1). Show that 

Tr(AipAx) = ff Ke.) He wdnlo)dty) 
The Hilbert space £L2(H) is, therefore, canonically isomorphic to the L? space 
of ps X pL. 


(vii) Von Neumann algebras."® Von Neumann began his work on operator 
algebras with F. J. Murray by introducing several useful topologies, all of 
them defined by families of seminorms, on the space £(H) of continuous 


6 The content of this section will be rarely used before n° 31 of this chapter. 
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operators on H. They are the following ones, which impose more and more 
restrictive conditions on the notion of convergence. 

(a) The weak topology’ (or of simple weak convergence) is obtained by 
making the maps T +> Tz continuous with respect to the weak topology of 
H, in other words by making all functions T +> (T'z|y) in £(H) continuous in 
L(H). It is, therefore, defined by the seminorms T +> |(Tx|y)|. Hence every 
finite system of inequalities 


(Tx; — Axi|yi)| <i 


is an open neighbourhood of A with respect to the weak topology, and every 
open subset is the (finite or infinite) union of such open subsets. 

(b) The strong topology (or of simple strong convergence) is obtained by 
doing the bare minimum to make the map T’+> Tx continuous for all x € H. 
It is defined by the seminorms T ++ ||Tz||. A subset E of £(H) is a strong 
neighbourhood of a given A if and only if there are finitely many x; € H and 
r; > 0 such that F contains all operators T satisfying inequalities 


||Ta; — Ax|| << r;. 
(c) The ultrastrong topology is given by the seminorms 
pa(T) = ||T Alle 


for all Hilbert-Schmidt operators A. This time, the bare minimum is done to 
ensure that all maps T + TA from £(H) to £L2(H) are continuous. It can also 
be defined differently: any family (a;) of vectors such that S> ||a;||?_ < +00 
defines a seminorm 


Tt (© Pas?) 


on £L(H) and these are the seminorms which give rise to the ultrastrong 
topology. 

(d) The last and most obvious one is defined by the norm ||T'|, and is 
the one that we will most frequently use. Some authors call it the uniform 
topology (understood: of uniform convergence on the unit sphere of H), but 
I will often say convergence in norm. 

Studying the properties of algebraic operations in relation to the above 
topologies is important. Results are readily obtained, but risks of confusion 
cannot be ignored. Statement (0) is the most useful and simplest. 

(a) The maps (A, B) + AB and A+ A* are continuous with respect to 
theuniform topology (obvious). 


"7 Not to be confused with the weak topology on the Banach space £L(H): there 
are many other types of continuous linear functionals on this space apart from 
the functions considered here. See J. Dixmier, Les fonctionnelles linéaires sur 
Vensemble des opérateurs bornés d’un espace de Hilbert (Annals of Math., 51, 
1950) 
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(8) The map (S,T) ++ ST is unrestrictedly continuous only with respect 
to the convergence in norm. For example, considering the unitary operators 
U and V in L?(Z) given by, 


Uf(n)=f(nt+1), Vim) =f(n—1), 


it follows that U? f and V? f, converge weakly to 0 for all f as p increases 
indefinitely (exercise !), but U?V” = 1 does not converge weakly to 0. 
(y) The inequality 


||STa — SoTox|| < |[S\|. ||La — Tox|| + || SToa — SoTox| 


shows that the map (S,T) ++ ST is strongly continuous if S (or T) is forced 
to stay in a bounded subset of £(H). In particular, the map T > ATB is 
strongly continuous for given A and B. 

(6) The map T ++ ATB is weakly continuous for all A and B since 


(AT Baly) = (TBa|A*y) . 
It is also strongly continuous since, for all x € H, 
|| AT’ Ba — AT" Ball < ||Al]. |Z" - T") Bal] , 


and a similar calculation shows that it is continuous with respect to the 
ultrastrong topology. The map T' ++ T™* is weakly continuous since |(T*z|y)| = 
\(Tyle)|. 

These topologies are mainly useful in the theory of self-adjoint operator 
algebras. If A is such an algebra, the set A’ of operators commuting with 
all the T' € A is obviously also a self-adjoint algebra, containing the unit 
operator, closed with respect to the weak topology [because of property (6)] 
and perforce with respect to the others. It is called the commutator algebra 
of A. A’ induces a new self-adjoint algebra A” = (A’)’ containing A, its 
bicommutator, then (A”)! = (A‘)” and so on. Fortunately, the sequence is 
periodic since it is always the case that (A’‘)” = A’. 


Von Neumann density theorem. Let A C L(H) be a self-adjoint algebra. 
Then (A’)"” = A’. If A contains the unit operator, A” is the closure of A 
with respect to each topology, weak, strong, and ultrastrong. If A is closed 
with respect to one of these topologies, then A” = A. 


(i) The first step is to show that, for all T € A”, alla © H and all r > 0, 
there exists S € A such that ||Ta — Sal| < r. Now, the closure € of the set 
of Sa is a A-invariant vector subspace. As a result, the projection onto € 
is in A’ (lemma 3), hence commutes with the elements of A”, so that € is 
A” -invariant. However, € contains a since 1 € A. So Ta € €, and the result 
follows. 
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(ii) Let us now show that for all r > 0, T € A” and a; € H such that 
>= |\a:|]? < +00, there exists S € A such that 


(19.16) S > ||Ta;— Sail? <r, 


which will prove that A is dense in A” with respect to the ultrastrong 
topology. To do this, we consider the space H’ of x = (a;) such that 
> ||zi|/? < +00, equipped with the inner product (x|y) = S<(a;|yi). It is 
complete since it is actually the space of Hilbert-Schmidt operators. For all 
Aé L(H’), there is a matrix (A;;) with entries in £(H) such that 


y= Axe y=) Aya;. 
J 


The usual multiplication rule holds for these matrices if the order of the 
factors is respected. The matrices A;; cannot obviously be chosen arbitrarily 
unless, for example, they are all zero up to finitely many exceptions. 

Let us associate to each S € £(H) the “diagonal” operator 7(S) in L(H’) 
which multiplies each component of x by S. An operator A on H’ clearly 
commutes with these 7() if and only the matrices A;; commute with all S, 
i.e. are in A’. An operator B commutes with all these operators A if and only 


if 
S > Bir Any = >> Air Bry 
k k 


for all i,j and A;; € A’. Supposing all except one of these Aj; to be zero, 
one sees that B = 7(T) with T € A”, and conversely. In other words, 


m(A)” =T (A”’) 


Applying the result obtained in (i) to 7(A) and setting a = (a;) the conclusion 
follows that, for all T € A”, there exists S € A such that 


I|n(T)a—x(S)al] <r, 


which leads to (16) and to the fact that A” is the ultrastrong closure of A. 
(iii) Let B c A” be the weak closure of A. It is a self-adjoint algebra 
containing 1, which is weakly and thus ultrastrongly closed. As a consequence, 
by the result just obtained, B” = B. However, anything commuting with A 
commutes with B because of property (6) above, and conversely since B > A. 
Hence B’ = A’, and so B= B” = A”. 
(iv) If 1 ¢ A, then 1 € A’, and so A’ = (A’)", qed. 


In practice, one frequently needs to apply the previous theorem to self- 
adjoint algebras A not containing the unit operator. One can then apply it 
to operators T + Al, where T € A and 4 € C, but it would be good to be 
able to avoid additional scalars. The next lemma enables us to choose: 
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Lemma 9. Let A be a self-adjoint operator algebra on a Hilbert space H. 
A is everywhere dense in A” with respect to the ultrastrong topology if and 
only if 


(19.17) Ac=0 forall ACA=>c2c=0. 


(17) is obviously necessary. So suppose that (17) holds. 

The previous theorem tells us operators T + 1, where T € A, are ultra- 
strongly dense in A”. If one shows that the operator 1 is in the ultrastrong 
closure of A, the result will follow. 

Coming back to the space H’ in the proof of the density theorem, the 
proof reduces to showing that every a € H’ belongs to the closure € of the 
subspace of the set of vectors 7(T)a. To this end, let us write a = a’ + a” 
with a’ € € and a” orthogonal €. Since operators 7(T) map mcalE to €, 
they transform every vector orthogonal to € into a similar vector (lemma 3). 
The vector 7(T)a” is, therefore, orthogonal to €. But it is equal to Ta— Ta’, 
which is in € like Ta and a’. Hence 1(T)a” = 0 for all T, so that Ta’ = 0 for 
all components of a”. Thus, if condition (17) holds, then a” = 0 anda é €, 
qed. 


Self-adjoint algebras containing 1 and weakly (or strongly, or ultra- 
strongly) closed, i.e. von N’s rings of operators, are not now called von 
Neumann algebras."® They are characterized by the relation A” = A. 
A’ A” = A’'QA is clearly the centre of A, ie. the set of S € A such 
that ST = TS for all T € A. Von N’s entire theory is about algebras for 
which A’ A reduces to scalars, i.e. algebras he used to call factors and 
which he classified, a classification that has since been much improved. 


20 — Gelfand’s Theorems on Normed Algebras 


The set £(H) of continuous operators on a Hilbert space, or more gener- 
ally a Banach space, is an example of a complete normed algebra, a notion 
introduced by Gelfand in his famous 1941 article. Like everyone, I will fol- 
low it closely since attempts at simplification are bound to fail. This theory 
gave rise to much hope in the 1940s. It also showed its limits fairly quickly: 
one cannot prove everything without any explicit calculations or only using 
abstract nonsense, as Serge Lang calls it. 

This is the name given to any algebra’? A over C, with a unit element, 
equipped with a norm satisfying 


78 See Jacques Dixmier, Les algébres d’opérateurs dans Vespace hilbertien 
(Gauthier-Villars, 1957) andLes C*-algébres et leurs représentations (Gauthier- 
Villars, 1964) and, more recent and less concise, Richard Kadison and John 
R. Ringrose, Fundamentals of the Theory of Operator Algebras (AMS, 4 vol- 
umes, two of text and two of solved exercises, 1999-2001). 

79 An algebra A over a commutative field & is a ring with unit element equipped 
with the structure of a vector space over k such that Al.a2 = x.Al = Ax for all 
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(20.1) Ilvyll < lla 


and with respect to which A is a complete space, and hence a Banach one. 
Other examples occur readily: the algebra of bounded continuous functions on 
a topological space X, equipped with the usual multiplication and the norm 
of uniform convergence; the algebra of bounded measures on some lIcgG, 
equipped with the convolution product and the usual norm. And any closed 
subalgebra containing 1 of a complete normed algebra is also a complete 
normed algebra. If for example H is a Hermitian operator on a Hilbert space 
H then considering the set A of continuous operators that are limits, with 
respect to the usual norm, of polynomials 5° a,H” in H, we get a complete 
normed algebra which, as we will see, is isomorphic to the algebra of contin- 
uous functions on the spectrum of H. The spectral decomposition of H will 
follow easily from this result. 

The spectrum of « € A, defined to be the set Sp(x) of ¢ € C for which 
x — ¢ is not invertible®® in A generalizes the notion of the spectrum of an 
operator. If P is a single-variable polynomial with complex coefficients, the 
relation 


P(x) — P(¢) = ( — ()Q(x) = Q(2)(z — ¢), 
where Q is also a polynomial, shows that 
¢ € Sp(x) => P(C) € Sp[P(x)] . 
Indeed, if P(x) — P(¢) has an inverse y, then 
yQ(x)(% — ¢) = (w@— C)Q(x)y = 1, 


so that x — ¢ is invertible. 


Lemma 1. Let A be a complete normed algebra. For alla € A, the spectrum 
of x is a non-empty compact set contained in the disc |¢| < ||z]]. 


To prove this, one sets that the geometric series }> x” converges for |||] < 
1. Since calculations already encountered in Chap. II show that 


(l-a)14+e+..J=1, 
it follows that 1— x is invertible for ||z|| < 1 and that 


(l—2z)t=l+ete't.... 


A € kand alla € A. A1 is usually shortened to A. Examples: the algebra M,,(k) of 
k-valued functions on a given set, etc. Finite-dimensional algebras over Q have 
long been studied using methods totally unrelated to those we are interested 
in here, and that are far more difficult; see André Weil, Basic Number Theory 
(Springer, 1967). 

8° An element x of a ring is invertible if yx = 1 and xz = 1 have solutions. Then 
y=z=n. 
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For arbitrary x and ¢ £0, 


C-x=C(1—a/¢), 


so that left hand side is invertible for |¢| > |||], which shows that the spec- 
trum of x is contained in the disc of radius ||«||. If ¢ — 7 = y is invertible and 
if h € C, then 


y+h=y(1+hy"') 


is invertible for sufficiently small |h|, which shows that ¢’ — x is invertible for 
¢’ sufficiently near ¢. The complement of the spectrum is thus open, and the 
spectrum is compact. 

The function f(¢) = (¢ — x)~+ is defined on the open set C — Sp(z). It 
is analytic on it since standard calculations show that, like in C, f(¢ + h) 
is a power series with coefficients in A for the variable h € C, when |h| is 
sufficiently small. For |¢| > ||], 


(20.2) Gea) SC" Se) SO" es 


from which it follows that f(¢) tends to 0 at infinity. 

If the spectrum of x was empty, then f would be an analytic function on 
all of C, tending to 0 at infinity. Liouville’s theorem would then show that 
f = 0, which is absurd, qed. 

This supposes a generalization of Liouville’s theorem to functions with 
values in a Banach space H. In fact, the entire theory of analytic or holo- 
morphic functions (and much more: derivatives and differentials of functions 
of real variables, integrals, Fourier series and integrals, etc.) is valid in this 
framework, the proofs being almost the same as in the standard case. For 
example, an analytic function is holomorphic, i.e. has has a derivative 


f'(¢) =lim[F(6 +h) — FO] /h, 


and actually f(¢ +h) = So fp(¢)hl” implies that f’(¢) = So fp(Q)alr-4 
with coefficients f,(¢) € H. The path integral of a holomorphic function wath 
values in a Banach space can be defined as in Chap. VIII because, since n° 
4 of the present chapter, we know how to integrate vector-valued continuous 
functions. Invariance under homotopy is not a problem, neither is Cauchy’s 
integral formula for a circle [Chap. VIII, n° 4, (iii)] or a more complicated 
contour. It follows that a holomorphic function f is represented in a disc 
centered at some a where it is defined by its power series at a € C, hence in 
all of C if f is entire. Then Liouville’s theorem follows as in Chap. VII, n° 18 
by taking upper bounds of the coefficients of the power series using Cauchy’s 
formula for a circle. 


Lemma 2. If every non-trivial element of a complete normed algebra A is 
invertible, then A has dimension 1. 
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Indeed, for all x € A, there exists ¢ € C such that x — ¢ is not invertible, 
whence x — ¢ = 0. 

In a ring A with unit element, a left ideal is an additive subgroup I of A 
for which 


cel & yEeAS=yead. 


Right and two-sided ideals can be similarly defined. If A is commutative, one 
talks of ideals. For example, in Z, any ideal is the set of multiples nZ of a 
given integer n, namely the smallest n > 0 such that n € I. In £L(H), where 
H. is a vector space, the operators with a given x € H as a zero form a left 
ideal. In the ring of continuous functions on a space X, the functions with a 
given point as a zero form an ideal. Etc. 

In a commutative ring A, the set Az of multiples of a given x € A is an 
ideal. It is equal to A if and only if x is invertible. A commutative ring A is, 
therefore, a field if and only if its only ideals are {0} and A. 

A left ideal J Cc A is said to be maximal if I 4 A and J is not strictly 
contained in any other left ideal apart from A. In Z, this means that I = pZ 
for some prime p. 


Lemma 3 (Gelfand-Mazur). Let I be a maximal ideal of a commutative 
complete normed space A. Then dim(A/I) = 1. 


Indeed, the quotient of a commutative ring by an ideal J can be consid- 
ered to be a commutative ring by observing that, for 7,y € A, the coset of 
zy mod I only depends on those of I: if a,b € IJ, then indeed 


(c+ a)(y+6)=ary+ay+arb+abeay+I 


because of commutativity. The inverse images under the ring homomorphism 
p: A —> A/T of the ideals of A/T are obviously the ideals of A containing 
I. As a result, an ideal J # A is maximal if and only if A/T is a field. 

But if A is a complete normed space, the elements x such that ||a~—1|| < 1 
are invertible. Hence any non-trivial ideal has trivial intersection with this 
open ball, so that the closure of a non-trivial ideal is again a non-trivial ideal. 
Any maximal ideal is, therefore, closed. 

But then the quotient A/J is also a complete normed space with respect 
to the norm 


(20.3) IIp(x)|| = inf ly 
p(y)=p(«) 

which can be more generally defined on the quotient of Banach spaces by 
closed vector subspaces. The outcome is again a Banach space since any 
Cauchy sequence in the quotient is the image under p of a Cauchy sequence 
in the original space. Relation (1) is also be readily verified to hold in A/TJ, 
which thus becomes a commutative complete normed algebra without any 
non-trivial ideals. All its non-zero elements are, therefore, invertible. The 
result now follows from lemma 2 applied to this algebra. 
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The question of the existence of maximal ideals can be radically solved: 


Krull’s Theorem. Every left ideal I A A in a ring A with unit element is 
contained in at least one maximal left ideal. 


This is natural more difficult than proving the existence of prime divisors 
in Z. If A is a finite-dimensional algebra over a field, the result follows imme- 
diately from a dimension argument. This is also the case if A is a Noetherian 
commutative ring (i.e. in which every increasing chain of ideals is stationary), 
but such objects are not encountered in functional analysis. In the general 
case, one is forced to use methods based on transfinite induction.®! This is 
also the case when, for example, proving the Hahn-Banach theorem. Krull’s 
theorem is extraordinarily simple, as general as can be and above all is useful 
outside functional analysis. It is therefore better to admit it rather than in- 
venting ad hoc arguments for separable normed algebras as Dieudonné does 
(15.3.4.1). 


Theorem 32. Let I be a maximal ideal of a commutative complete normed 
space A. Then there is a linear functional x on A such that x(1) = 1, 


(20.4) x(ty) = x(z)x(y) for z,yeA 


and for which I is defined by the relation x(x) = 0. For every solution of (4), 


(20.5) lx(x)| < |la|| for all ae A. 


Since A/I has dimension 1, for all « € A there is a unique scalar y(z) 
such that 


x—x(x).1leTl. 


The multiplicativity formula follows trivially. So does the characterization 


of x € I. Conversely, any non-trivial solution of (3) clearly defines an ideal 
ITAA. Asax— y(x).1 € I cannot be invertible, x(a) belongs to the spectrum 
of x, which gives |y(x)| < ||z|] and the continuity of x. 


8! The universal tool to prove this type of result is Zorn’s theorem, which we now 
state. Consider a set FE equipped with an order relation x < y. Suppose that 
every totally ordered subset F (i.e. x < y or y < x always holds in it) of E has 
a supremum in EF: there exists M € FE such that, for x € E, 


a>y foralyeFour>M. 


Then FE has at least one maximal element x, i.e. such that y > x implies that 
y = «. Krull’s theorem follows by taking E to be the set of left ideals J D I 
distinct from A, ordered by inclusion. For a totally ordered subset F' of E, the 
union of J € F, distinct from A as they do not contain 1, is again an ideal and 
is obviously the supremum of F in E. 
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Non-trivial solutions of (4) are called the characters of A. The spectrum 
of A, ie their set, will be written A. To each, x € A associate the function & 
on A, the Gelfand transform of x, given by 


(20.6) @(x) = x(2). 


The map x +> @ is, therefore, linear and satisfies 


_—_ 


(zy) = ay, |[2|| < lal, 


where ||Z|| is the uniform norm on A. Furthermore, the spectrum of x is 
the set of values of the function Z. Indeed, x € A is not invertible if and 
only if the ideal Ax of multiples of x is distinct from A, hence is contained 
in a maximal ideal. So ¢ belongs to the spectrum of x if and only if there 
exists y such that y(x) = ¢, proving the result. In conclusion, the function 
f(¢) =(€ — 2)? is defined and holomorphic on |¢| > ||@]|. 

The inequality ||Z|] < ||x|] can be made more precise: 


Lemma 4. For allxé A, 


(20.7) \|Z|| = lim Ja” ||/". 


For all n and character x, |x(z)"| = |x(a")| < ||a"||. Thus |xy(z)| < 
lz” ||1/” for all x and so 
(20.8) ja” ||2/" > ||2I| for all n. 


On the other hand, the function f(¢) = ¢(¢ — 2)~! = (1—2/¢)~! is defined 
and holomorphic for |¢| > ||Z||, hence has a Laurent series expansion in this 
open subset. However, f(¢) = > a"/¢” for |¢| > ||a||. This series, therefore, 
converges for |¢| > ||Z|| (uniqueness of the Laurent series). For |¢| = ||%|| +r 
with r > 0, its terms tend to 0, and so ||a”|| < |¢|". Hence ||x"||!/" < ||2|| +r 
for large n. This together with (8) proves (7), including the existence of the 
limit. 

There is a natural topology on A, which is obtained by doing the bare 
minimum to make the functions % continuous. This means that (a) for all 
xo € A, x €Aandr > 0, the inequality 


Iz(x) — 2x0) <7, 
i.e. 
(20.9) Ix(x) — xo(x)| <r, 


must define an open subset of A, (b) a subset of A is open if and only if it 
is the union of sets defined by finitely many inequalities of the form 


Gti cee Cece paces 
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As stated above, this is the weak topology, which can be defined on the dual 
of any topological vector space.®? 


Lemma 5. Let H be a Banach space, H’ the space of continuous linear 
functionals on H and B’ the unit ball ||f\| <1 of H'. Then B’ is compact 
with respect to the weak topology. 


The proof is easy provided any (finite or infinite) Cartesian product of 
compact spaces is admitted to be compact, which is another application of 
Zorn’s theorem mentioned above in a footnote. This being admitted, let D 
be the disc |z| < 1 of C and B the unit ball ||z|| < 1 of H. To each x € B, let 
us associate a copy D, of D and let 92 the Cartesian product of the sets D,: 
it is therefore the set of families ¢ = (¢,) of elements of D, equipped with 
the minimum topology making the maps ¢ +> ¢; from 2 to D continuous. 
As D is compact, so is 2.5% This done, let us associate to each f € B’ the 
element ¢ = j(f) of 2 given by ¢, = f(x) for all a € B. The map j from B’ 
to 2 is obviously injective, and is a homeomorphism from B onto its image 
since in both cases, we did the bare minimum to make the maps f # f(x) 
and ¢ + ¢, continuous. Hence the proof reduces to showing that 7(B’) is 
closed in 9, i.e. that if a linear functional f € B’ varies in such a way that 
lim f(a) = g(a) exists for all x € B, then g is the restriction of a continuous 
linear functional g € B’ to B. This is clear since the limit of a sum is the sum 
of the limits and since taking limits preserves the inequality | f(x)| < |||. 

If we come back to the normed algebra A, the space A is obviously con- 
tained in the unity ball of the topological dual A’ of A. It is closed with 
respect to the weak topology since, by definition of the weak topology, for 
given x and y, both sides of the equality y(xy) = y(x)x(y) are continuous 
functions of x. Taking limits also preserves equality y(1) = 1. In conclusion, 
the space A is compact. 


8°? Even more generally, taking a set X, a family (F;) of topological spaces and a 
family (fi) of maps from X to Fi, there are topologies on X with respect to 
which the maps f; are continuous. With respect to such a topology, the sets 
f, ‘(Ui), as well the intersection of finitely many such sets, must be open for all 

i and all open subsets U; of F;. The simplest is then to stipulate that a subset of 

X is open if and only it is the union of such intersections. Checking the axioms 

reduces to the calculation rule on sets of Chap. I. With respect to this topology, 

a varying x € X converges to a limit a € X if and only if lim f;(x) = fi(a) for 

all 7. For example, if X is a set of linear functionals on a real or complex vector 

space H, the weak topology on X is obtained by applying the previous procedure 
to the functions f > f(x), « € H. If X is the Cartesian product of a family of 
topological spaces X;, the product topology on X is obtained by choosing the 

projections x +> x;, onto the factors X;. 

If H is separable and if (an) is a sequence everywhere dense in the unit ball B 

of H, then the weak topology on B’ (but not on H’) can be defined using the 

functions f +> f(an), with values in D. This defines a map f +> (f(an)) from 

B’ to the Cartesian product D', whose compactness has been proved without 

applying Zorn’s theorem in n° 11, (i) about corollary 2. 
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21 — A Characterization of Algebras of Continuous Functions 


Let X be a compact space and A = L(X) the normed algebra whose elements 
are the continuous functions on X, equipped with the norm 


(21.1) II f|| = sup |f()| 
of uniform convergence. If 
(21.2) f*(x) = f(a), 


denotes the conjugate function of a function f € A, relations (19.5), (19.6) 
and (19.7) obviously hold. I will use the terminology GN algebra (following 
Gelfand and Neumark, 1943) rather than “C*-algebra”, like many authors 
do, for a complete normed algebra A equipped with an involution 1 WH «* 
satisfying relations: 


(21.3) eH oer See. a Sas 
(21.3’) I|a* || = llal| , le*arl] = [all . 


The Gelfand-Naimark theorem affirms that, for a commutative GN algebra, 
the map « + & from A to the algebra L(A) of continuous functions on 
the spectrum of A, in other words the “Gelfand transform” (20.6), is an 
isomorphism no matter how you look at it. This means that it is a ring 
homomorphism — obvious -—, that it transforms the given norm ||z|| into the 
norm (1) of the function Z, that it transforms the given involution 7» «* 
into the involution (2), and finally that it is bijective. 

The proof is simple but very ingenious. Its authors were quite obviously 
inspired by what was already known about operators on a Hilbert space, and 
in particular by lemmas 4 and 5 of n° 19. In what follows, we shall set X = A. 

We start by observing that, if « € A is invertible, so is x*. Since (¢—x)* = 
¢—a*, the spectrum of x* is the image of the spectrum of x under ¢ 4 ¢. 

Secondly, the spectrum of every x such that x = x* is real. As in the case 
of a Hermitian operator, it suffices to show that x — 7 is invertible. Suppose 
this is not true. As « —i = (a — Gi) — i(¢ — 1), i(¢ — 1) € Sp(a — ¢2) for all 
¢ €C. Hence |¢ — 1| < ||a — Gil], and so, for any real ¢, 


(¢- 1)? < lla — Call? = Il(@ — ¢é)*(@ — ¢a)I] = lz? + C7 || < lel? +07, 


which contradicts the first order binomial theory. 
Since any x € A can be written 2 = x’+ix” with Hermitian x’ = $(2+2*) 
and xv” = (x — «*)/2i, and so x* = a’ — ia", more generally, 


(21.4) x (2") = x(x) 


for any character of A. 
The image of A in L(X) under x +> @ is, therefore, stable under conjuga- 
tion. This image is a subalgebra of L(X). It separates the points of X since 
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the relation y(x) = (2) is satisfied by all x only if y = x’. The Stone-Wei- 
erstrass theorem then shows that x ++ Z maps A onto an everywhere dense 
subspace of L(X). 

To show that the image of A is equal to L(X), it then suffices to show 
that it is closed, and for this purpose to use the last relation 


Ill] = aI 


which needs to be proved. Since, by assumption ||xz*z|| = ||a||/? and, by (4), 
|x (x*x)| = |x(x)|?, it suffices to prove it for x*z, i.e. for Hermitian 2. However, 
we know (n° 21, lemma 4) that: ||2|| = lim ||x”||!/". Since 2” is Hermitian for 
all n, by (4), 


le*l] = Me? Well = [oP IF = Weel? 


npr 


etc. So for n = 2°, ||x = ||z||, whence the theorem: 


Theorem 33. Let A be a commutative GN algebra. Then the Gelfand trans- 
form x &, defined by 


is an isomorphism from A onto the GN algebra of continuous functions on 
the spectrum A of A. 


A corollary of this result is that, if f(¢) is a complex-valued function 
defined and continuous (but not necessarily analytic!) on the spectrum of 
some x € A, then there is a unique y € A such that 


GO) = F[@Od] for all x : 


it suffices to check that the right hand side is a continuous functions on 
X = A. We set y = f(x). The spectrum of y is the image under f of the 
spectrum of x, and ||y|| = |||], where this is the uniform norm on Sp(x). If 
f =g+hor gh, then f(x) = g(x) + g(h) or g(x)h(x). Composing f with a 
continuous function g on the image of Sp(x) under f gives go f(x) = g[f(x)], 
etc. 

Let us for example consider the algebra A of bounded continuous complex 
functions on a topological space E, with the obvious norm and involution. 
Every x € F defines a character y, : f +» f(a) of A which, by definition, is 
a continuous function of x with respect to the topology of A. This gives a 
continuous map from E to the Cech compactification A of E. It is a compact 
space which in general is far too gigantic to be of any interest other than 
theoretical. In particular, it is not metrizable, even for FE = N. 

But if & is compact, in which case B(E) = L(E), the embedding of E 
into X = A isa homeomorphism. As it is continuous, it suffices to show 
that it is bijective. Injectivity is equivalent to saying that, for given x # y, 
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there exists f € L(E) such that f(a) 4 f(y). This is obvious. Surjectivity is 
equivalent to saying that, for every character x of L(E£), there exists 2 such 
that y(f) = f(x) for all f € L(£). However, let I be the ideal x(f) = 0. If 
there exists « such that f € J, the linear functionals f +> x(f) and f > f(z) 
have the same kernel, hence are proportional and in fact identical since these 
are characters, proving the result. So it suffices to show that all the functions 
f € JI vanish at some point of X. Otherwise, for all x € X, there exists f € I 
which does not vanish at x and hence in the neighbourhood of x. By BL, we 
get finitely many f; € I such that the open sets {f;(x) 4 0} cover E. The 
function g = > f* fi = >| fil? is again in J, it is > 0 everywhere, and so 
invertible in A, a contradiction since I £4 A. 

Exercise 1. Let E be a locally compact space and A the algebra of con- 
tinuous functions on £ which tend to a finite limit at infinity. Equip A with 
the obvious norm and involution. Using the same method, show that A is 
the Alexandrov compactification EU {oo} of E. 


22 — Spectral Decompositions 


(i) The GN algebra of a normal operator. Let H be a Hilbert space and 
A c L£(H) a GN algebra, i.e. satisfying the following conditions: A is self- 
adjoint, i.e. JT € A implies T7* € A, A is closed with respect to the topology 
defined by the norm of the operators. Let us suppose that A is commutative. 
The elements of A then satisfy N*N = NN*. This property characterizes 
normal operators, which were studied by von Neumann around 1927 in the 
context of Hilbert spaces. In finite dimensional, they are diagonalizable, for if 
¢ is an eigenvalue of N, the corresponding eigenspace H(C¢) is invariant under 
N and N* since N and N* commute, hence so is its orthogonal complement 
(n° 19, lemma 3), and diagonalization follows by induction on the dimension 
of H. 

Starting from a normal operator N and considering operators that are 
limits (in norm) of polynomials 


(22.1) P(N,N*) = 50 apgN?N*, 


one gets an algebra A(N) = A of the former type, hence isomorphic to 
the algebra of continuous functions on the spectrum of A. It can be easily 
determined: 


Lemma. Let N be a normal operator on a Hilbert space H. Then the map 
x x(N) is a homeomorphism from the spectrum of A(N) onto the spectrum 
of N in L(H). 


For a character x of A = A(N), clearly 


(22.2) x (P(N, N*)] = P [x(N), XCN)| 
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since y(T*) = y(T) for all T € A. As the linear functional is continuous, it 
is fully determined by the number ¢ = y(N). This gives a continuous injective 
map (hence a homeomorphism) y +> y(N) from the compact space A onto 
a compact subspace of C, namely the spectrum S of N in A as was seen in 
n° 20 for every normed algebra. Thus A(JV) is isomorphic to the algebra of 
continuous functions on S by the GN theorem. Let o be the spectrum of N in 
L£(H). Clearly o C S. To prove the inverse inclusion, it suffices to show that if 
N —C is invertible as an operator on H, then its inverse is in A. Replacing N 
by N —¢, the proof then reduces to showing that if N is invertible in L(H), 
then N—! is the limit in norm of polynomials of N and N*. This property is 
special to normal operators. 

To prove this, we consider the set B = A(N, N~?) of limits in norm of 
polynomials of N, N~! and of their adjoints. It is again a GN algebra. As 
above, the map x +> x(V) from the spectrum B of B to C is clearly injective 
and continuous, and so is a homeomorphism from B ontoa compact subset 
of C*, and the map f +> f(N) is an isomorphism from the algebra L(/¢) onto 
B. The elements N, N~* and N* of B correspond to the functions ¢,1/¢ and 
¢ on K. However, the function 1/¢ is the uniform limit on K of polynomials 
in Cand ¢ (Stone-Weierstrass). As f ++ f(N) preserves norms, N~! is the 
limit of polynomials of N and N%*, qed. 

In fact this proves that A(N,N~+) = A(N) if N is an invertible normal 
operator in L(H). An immediate consequence of the previous lemma is that 
if N belongs to a closed self-adjoint subalgebra A of £(H), commutative or 
not, then the spectrum of N in A is the same as its spectrum in £(H), in 
other words does not depend on A, since if, for example, N is invertible as an 
operator, then N~! € A(N) c A. Hence it is possible to talk of the spectrum 
of a normal operator without mentioning the GN algebra considered. 

We saw in n° 19, lemma 5 that, for a Hermitian operator H, the smallest 
compact interval of R containing the spectrum of H is [my, My]. If H is 
positive, i.e. if mz > 0, the function f(¢) = ¢'/? is defined and continuous 
on this interval. Setting H!/? = f(H) gives a Hermitian operator such that 


(22.3) (HA) EES, 


These properties fully determine H!/? among all Hermitian operators in H. 
Indeed, let H’ be a positive Hermitian operator such that (H’)? = H. The 
algebra A(H"’) is isomorphic to L(S’), where the spectrum S” of H’ is con- 
tained in Ry (n° 19, lemma 5). A(H’) contains H, hence also H'/? € A(H) Cc 
A(H"). The continuous functions corresponding to H and H’ in L(S’) are 
CH C2 and ¢ + ¢. The one corresponding to H!/? is positive-valued since its 
values are the elements of the spectrum of H!/?. It is, therefore, the positive 
square root of the function ¢ +> ¢? on $’, hence the function ¢ +> ¢; thus 
H}/2 = H', qed. 

Exercise 1. Let H and H’ be two commuting positive Hermitian opera- 
tors. Show that their square roots commute. Deduce that HH’ is positive 
Hermitian. 
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Exercise 2. Let B be the set of Hermitian operators H such that 0 < 
H <1. (a) Show that, for all r > 0, there is a real polynomial p(¢) such that 
|| H‘/? — p(H)|| <r for all H € B. (b) Let H, € B be a sequence of operators 
strongly converging to some H € B [for example, an increasing sequence: 
n° 19, (v), lemma 8]. Show that H,/ strongly converges to H*/? [first show 
that, for any polynomial p, the map H ++ p(#) is strongly continuous on B}. 

Exercise 8. For a positive Hermitian operator H, define a positive Hermi- 
tian operator H*, where s € R and show that H*H*t = H***. Show that the 
function s + H* extends to a holomorphic function on C — R_ and that the 
set of H*’, t CR, is a group of unitary operators. 


(ii) Spectral measure of an operator algebra. Let us now consider the gen- 
eral case of a commutative GM algebra A in £(#) and denote its spectrum 
by X = A; it is a compact space. Conversely, the Gelfand transform T > T 
being bijective, one can associated the operator T = M(f) € A defined by 
the condition that 


) 
I 
SY 


to each f € L(X). Relations 


(22.4) M(f+g9)=M(f)+M(g), MGP)=M(f), 


as well as 


IM (AYN = INF 


trivially hold. The map fH M(f) from L(X) to A will be called a spectral 
measure of A. It is linear and transforms every real (resp. positive) func- 
tion into a Hermitian (resp. positive Hermitian) operator. The characteristic 
of these “measures” is the relation M(fg) = M(f)M(g): the only usual 
numerically-valued measures satisfying it are the Dirac measures. 

The isomorphism f ++ M(f) transforms every continuous linear func- 
tional on A into a continuous linear functional on D(X), i.e. into a measure 
on X. This the case of the map 


T+ (Taly) 
for all x,y € H. This gives a unique measure uz, on X such that 


(22.5) (M(f)zly) = tey(f) = ip Odile OO 


for all f € L(X) or, using a different notation, 


(ely) = / PCRS) 


for all T € A and x,y € H. These measures satisfy fairly obvious properties: 
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(MS 1) the map x+> pz, is linear ; 

(MS 2) the measures ply, are fz,y are imaginary and conjugate ; 

(MS 3) the measures fix. are positive and of total weight ||z||? ; 

(MS 4) duray(x) = T(x)dte,y(x) for all Te A; 

(MS 5) An operator T commutes with the elements of A if and only if 
(22.6) Lbre,y = be,r+y forall «,yeH. 


Property (MS 2) follows from the observation that, for real f € L(X), 
the operator M(f) is Hermitian, so that (M(f)aly) = (M(f)y|x), which can 
also be written Uz,y(f) = My,c(f). Hence the result. (MS 3) says that the 
operators M(f) are positive Hermitian for f positive. So (M(f)a|x) > 0. As 


Hew (A) = (Aaly)| < (PI ell lll 


and ||M(f)|| = || fll, in fact 


I[éeull S [lar [ly - 
(MS 4) follows from the fact that, for f,g € L(X), 


Lu (fxy(9) = (M(g)M(f)aly) = (M(fa)aly) = Hey(F9) - 


(MS 5) follows from relations 


Mroy(f) =(M(f)Taly), (CM (f)aly) = (M(f)2|T*y) = we,rey(f)- 


(iii) Integration with respect to a spectral measure. Instead of starting 
from a spectral measure associated to a GN algebra, more generally, it is 
possible to define a spectral measure for any locally compact space X as a 
map M: fH M(f) from L(X) to the algebra £(H) of a Hilbert space H, 
which must satisfy above conditions (4). By (5), one then gets measures tz, 
and they clearly satisfy (MS 1) and (MS 2). They also satisfy (MS 4) under 
the form 


dum(f)x,y (t) = f(t)dpx,y (t) 


for f € L(X), and (MS 5) for operators T commuting with all M(f). The 
fact that fiz,. > 0 is obvious. ||M(f)|| < || f|| always holds since, for |z| > || fl, 
there exists g € L(X) such that (f + z)(g + 27!) = 1. This relation implies 
[M(f)+2][M(g)+2—'] = 1 and shows that the spectrum of M(f) is contained 
in the disc of radius || f||, whence the result. One then readily sees that the 
measures jlz,, are bounded and that 


\léx,cl| = sup fe,x (|f|?) = sup || M(f)2\|? < |lal|?. 
\fl<1 
fEL(X) 
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Relation plx,.(X) = ||a||?, which implies pz,,(X) = (xly), nonetheless re- 
quires an additional assumption when X is not compact; it will be given 
later. The Cauchy-Schwarz inequality 


(22.7) lMx,y(f) : < Meo (f)My,y(F) 
shows that fiz, is bounded for all x and y and that 


[Haul] S [lar lly - 


When there is measure, even if it is with values in £(H), continuous 
functions are not the only ones that are integrable. A function y on X will 
be said to be M-measurable (resp. M-integrable) if it is measurable (resp. 
integrable) with respect to all measures fz,,, which is the case of Borel 
functions, if need be bounded to ensure integrability. A set N Cc X will even 
be said to be M-null if ua,.(V) = 0 for all x, and so pz,y(N) = 0. The map 
(x,y) fx, having all the formal properties of a Hermitian form — except 
that its values are measures —, the relation 


A(zly) = (x@+yla+y)—(e-yle—y) +... 


indeed shows that the uz,, are linear combinations of positive measures [1z,z. 
For a M-integrable function y, the number juz,,(y~) depends linearly on x 
by (MS 1) and semi-linearly on y by (MS 2). Moreover, as 


How (P)] S [eevll lel! S llell-lell-lyll, 
there is a unique continuous operator M(y) on H such that 


(M(¢)2|y) = May (Y) 


for all x, y (n° 19, corollary of theorem 33). The map y++ M(y) also satisfies 
relations (4) as we are now going to see. 

The first two are obvious by (MS 1) and (MS 2). To prove the third one, 
for which theorem 25 on the definition of measures by a density is essential, 
let us begin with an “arbitrary” function y and a continuous function g. 
Applying (M 5), 


(M(yg)2ly) = / p(t)g(t) djs y(t) = / v(t).g(t)de y(t) 


- i: v(t)dpar(gyey(t) = (M(g)M(g)aly) , 


whence M(yg) = M(y)M(qg). Taking adjoints, M(y)M(g) = M(g)M(y) 
follows since 


f sdursceren = (M(g)M(¢)zly) = (M(~)M(g)aly) 


= [ etestioen = f e.odiey by (MS 4) 


= f e9dus.y = f aed: 
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Thus 


(22.8) dum (y)2.y(t) = p(t)dpr.y(t) 5 


which is the analogue of (MS 4) for y. Now if ~ is an “arbitrary” function 
on X, then theorem 25 shows that 


(M(wW)M(e)aly) = barony ey b.0die y 
i. by.diny = (M(wy)aly) . 


So we finally get M(yw) = M(y)M(v). 

The latter shows that M(y) = M(1)M(v) for all y, and (4) that M(1) = 
M(1)* = M(1)?. As a consequence, M(1) = E is the projection onto a 
closed M(y)-invariant subspace H’ of H mapping the orthogonal complement 
H' onto 0. Obviously, H’ = H if X is compact. This is the case of the 
spectral measure of a GN algebra. M is then said to be non-degenerate. As 
bee(X) = Me,«(1) = ||M(1)2||?, condition 


He,e(X) = ||2|? 


says that M is non-degenerate. Otherwise, #1 may as well be replaced by H’. 
Hence in what follows, M will be assumed to be non-degenerate. 

Likewise, a trivial calculation using (MS 5) shows that M(y) commutes 
with every operator commuting with M(f), for all f € L(X), in other words, 
belongs to the von Neumann algebra generated by all M(f) [n° 19, (vii), 
density theorem]. 

Lebesgue’s theorems lead to limit properties for operators M(v). First of 
all 


d 


(22.9) || M(y)a|? = (M(v)2|M(¢)x) = (M(v)*M(y)a|x) = 


= / v(t)? ditze(). 


So if a sequence (y,,) converges M-almost everywhere to a function y, relation 


IM (y)x — M (gn) all? = / vt) — Galt)? dpte,a(t) 
shows that 
(22.10) lim M (yn)c« = M(y)a forall reH 


provided (y,) converges in mean to y in all L?(jz,.) spaces, for example if 
the functions y, are dominated by a bounded Borel function. Theorems on 
increasing series and sequences can be similarly interpreted. 

We next consider a M-measurable set w C X and let f be its characteristic 
function. Then f = f = f?, and so 
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As a result, M(f) = M(w), traditionally written E(w), is an orthogonal 
projection operator onto a closed subspace H(w) of H. By definition, 


(22.11) (M(e)cly) = f ditey(t) = Hay) 

and in particular, M(w) being a projection, 

(22.12) | M(w)al|? = (Mw)2|z) = pa2,o(w). 

Therefore, as x € H(w) is equivalent to ||/M(w)a|| = ||a||, hence to 
lla? = Hex (w) , 

and as [x,¢(X) = |la||?, one sees that 

(22.13) xe H(w) => Ue 2(X —w) =0. 


The H(w) are the spectral manifolds of the measure M (or of the algebra A 
when M is associated to A). 

Let us for example suppose that w = {xo} for some yo € X. Elements 
x € H(w) are then characterized by the fact that 


(22.14) Tx=xo(T)x« forallTeA. 


Indeed, these x are characterized by pix,2(X — {xo}) = 0 or, if y denotes the 
characteristic function of w, by the fact that T(x) — xo(T) v(x) = V(x) is 
zero almost everywhere with respect to Uz... As 


M(b) =T — xo(T)M (yg) = T — xo(T)M (w) 


and as plex (W) = f W(x) |?dttre(x) = ||M(~)a|]?, the result follows. 

The map w ++ M(w) has properties analogous to those of a measure and 
others without any parallel. First of all, for any finite or countable family 
(wy), relation (13) shows that 


(22.15) HW (N wn) =H (wn) - 


If the sets w, are pairwise disjoint, the corresponding projections cancel pair- 
wise since M(yw) = M(vy)M(w). So the subspaces H(w,) are pairwise or- 
thogonal. Moreover, H (LJ wn) is the Hilbert direct sum of the spaces H(w,). 
Indeed, let y, be the characteristic function of w, and y = >> yp that of 
w =Uwry. For all e € H, 


M(w)a = lim [M (w1)a+...+ M (wn) 2] 
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by (10) applied to the sequence yi +...+ Yn. The vectors t, = M(w,)x are 
pairwise orthogonal. If « © H(w), then M(w)a = a, and so 


x =lim (#1 +...+2n) 


and ||x||? = >> ||an||?. Conversely, taking x, € H(wn) such that Y> ||rp|l? < 
+oo, the limit x of the partial sums exists and M(w)s is the limit of the 
partial sums of the series }> M(w)a,. Now, M(w)ap, = Xp since 


In = M (Yn) tn = M (~Gn) tn = M(p)M (Yn) tn = Mw) on. 


The projections M(w) can be zero; but, for a GN algebra, M(w) 4 0 if 
w is a non-empty open set. Indeed, there is then a continuous function f on 
A which is not identically zero, but vanishes outside w. As it is equal to its 
product with the characteristic function of w, M(f)M(w) = M(f). Since f 
is continuous, the operator M(f) = A is in A and like f, is not zero. Hence 
M(w) £0. 

This shows that, 7f A does not only consist of scalar operators,, i.e. if the 
spectrum A of A has at least two points, then there exist spectral manifolds 
other than {0} and H: indeed, there are two non-empty disjoint open subsets 
w and w! in A, hence the that result since then H(w) and H(w’) are non-zero 
and orthogonal. 

An integral may be calculated approximatively using Lebesgue sums. Sim- 
ilarly, let us consider for an arbitrary spectral measure M, a M-integrable 
function y and, for given r > 0, let us partition X into finitely or countably 
many M-measurable sets w,, on each of which ¢ is constant, up to r. Let yy, 
be their characteristic functions. Choosing tn, € wn, 


let) — 5) (tn) en(t) <r for all t 
and so 


(22.16) |4(e) — 2 En) M (on) 


<r. 


This result justifies or explains the spectral decomposition formula 


(22.17) M() = | y(t)dM (t) 


n~ 


which is merely notation. For an algebra A and y € L(A), the operator 
M(y) = Aisin A and ¢(x) = A(y). So (17) shows that in particular 


217) A= [ Acoamn for all AE A. 


The fact that each A € A can be approximated by linear combinations of 
projections M(w) shows that a continuous operator T on H commutes with 
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all A € A if and only if it commutes with the projections M(w), in which 
case it commutes with M(y) for any M-measurable bounded function y on 
A. 

This result has an easy but important consequence, another version of 


which will be given in n° 23. 


Schur’s Lemma* I. Let A be a self-adjoint subalgebra of L(H) containing 
the unit operator. The following properties are equivalent: 

(i) the only A-invariant closed subspaces of H are {0} and H ; 

(ii) the only Hermitian operators commuting with all A € A are scalars. 
A is then everywhere dense in L(H) with respect to the ultrastrong topology. 
IfH is finite-dimensional, then A = L(H). 


Since A is self-adjoint, the orthogonal complement of a closed invariant 
subspace is again invariant (n° 19, lemma 3), so that the corresponding pro- 
jection commutes with all A € A. This shows that (ii) ==> (i). 

If (i) holds — in which case the algebra A is said to be irreducible —, let 
HT be a Hermitian operator commuting with A. As was seen above, so do its 
spectral projections [apply arguments to A(H)], the corresponding subspaces 
are A-invariant, and hence are trivial. 

Finally, under the assumption of the Lemma, the von Neumann algebra A’ 
of operators commuting with A consisting only of scalars, A” = £(H). Von 
Neumann’s density theorem [n° 19, (vii)] then shows that A” is the closure of 
A with respect to the ultrastrong topology (but obviously not with respect 
to the topology defined by the norm if 1 is infinite-dimensional). If H is 
finite-dimensional, so is £(H), all topologies coincide, and A = L(H) since 
in finite dimensional, every subspace is closed, qed. 

Above all, this result is relevant for unitary representations of a locally 
compact group G. Such a representation is [n° 15, (ii)] an ordered pair (H,U) 
consisting of a Hilbert space H and of a linear representation x +> U(x) 
by unitary operators. As U(x)* = U(x)~! = U(a~?), the set A of linear 
combinations of U(2) is a self-adjoint algebra, and the A-invariant subspaces 
(closed or not...) are invariant under operators U(x) and conversely. If the 
representation (H,U) is irreducible, the only continuous linear operators in 
H commuting with all U(x) are scalars, and conversely. We will return to 
this point in more detail in § 8. 


(iv) Spectral decomposition of a normal operator. Let us study the classical 
case of a normal operator N and of the algebra A = A(N) generated by 
N and N*. The space A can be identified to the spectrum of N by x 
xy(NV) = ¢. All measures, functions, integrals introduced in the general case 


84 The traditional version of Schur’s lemma assumes that we are in dimension is 
finite and can be proved in an elementary manner. A less obvious version, which 
does not use Hilbert structures, applies to “semisimple” algebras, i.e. those in 
which every invariant subspace has an invariant complement. 
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are, therefore, defined on Sp(N) = X C C, and N is the operator M(y) for 
the function ¢. Then relation (17) becomes 


(22.18) NPN*? = / CPC¢dM(¢), or (NPa|N4y) = if CPCIdpiz y(C) - 
This result is due to von Neumann®? 
is usually written dE(¢). 

If H is Hermitian, its spectrum is real and contained in [my, My]. In the 
former version of spectral decomposition, everything was reduced to Stieltjes 
integrals over R using projections 


except for notation: what I write dM(¢) 


(22.19) E(\) = M ((mu,))) », AE Imax, My], 
and right-continuous and increasing functions 


which define the right Radon measures [z,, of the general theory. Formula 
(17) is traditionally written 


My 
(22.21) f(H) = / P(AAM (A) 

MH 
for every bounded Borel function f on [my, My]. It was only really used 
for f(A) = XA”. One had to interpret as a Stieltjes integral in which the 
“measure” of an interval is an orthogonal projective operator on H. For 
example, the measure of Ja, 6] C [mx, Muy] is E(B) — E(a), ie. M(la, 6) 
in the notation M(w) of section (iii), but that of Ja, 6[, ie. M(Ja, 6]), is 
defined by taking the limit of E(6’) — E(a) as 6’ < B tends to £, etc. These 
constructions contained many traps since no one, apart from F. Riesz, used 
the characterization of measures as linear functions. The GN theorem, that 
was beginning to be glimpsed in the case of the algebra A(H), was replaced 
by explicit arguments on operators. 

If U is unitary, then U* = U~!, whence |x(U)| = 1 for every character 
of A(U). So the spectrum of U is contained in the unit circle |¢| = 1; thus 
one can interpret M as the spectral measure on T and write the spectral 
decomposition of U as 


U” = f wraM(u), 
T 


which is far better®® than the formula 


85 Von Neumann considered operators that were not defined everywhere, a far more 
difficult case than the one considered here. His 1929 articles in Math. Annalen 
are perfect examples of axiomatic and abstract arguments and can still be read 
profitably. Those who criticize N. Bourbaki’s taste for “modern” mathematics 
could also target the “father of computer science” ... 

86 Since it can be generalized to unitary representations of all locally compact 
commutative groups. 
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27 : 
U” =| e”'dE(t) 
0 
that one still encounters. 


23 — Self-Adjoint Operators 


(i) Inverse of an injective Hermitian operator. Besides continuous Hermitian 
operators, there are non-continuous ones that are not defined everywhere. 
These are the self-adjoint operators; they are very important, in particular 
in the theory of partial differential equations. They are the subject (von 
Neumann) of a similar theory where the spectrum is an unbounded interval 
of R, hence on which the function A is not necessarily integrable with respect 
to all measures juz,7. It is easy to understand where they come from even 
without leaving an “abstract” framework — it suffices to study the inverse, 
not defined everywhere, of a Hermitian operator, which is what we are going 
to do —, but it is possible to come up with concrete examples: the operator in 
L?(R), which transforms every function f(t) into tf (t) is obviously not defined 
everywhere, though it is quite reasonable. Taking the Fourier transform one 
gets f + 27if’, an operator whose direct definition would be more tricky. 

So let us consider a Hermitian, and hence continuous, operator H on H, 
and set X = Sp(H) to be a compact subset of R. H is invertible if and only 
if the function \ has a continuous inverse in X, which means that 0 ¢ X 
since X is closed. Introducing the measures ju,,, of (22.21) then gives 


(23.1) (Et alo = le Xda ds 
(23.2) \|— | ee Pen Ad blige (A) 


for all x,y € H. But if 0 € X, there is no reason why the function \~! should 
be integrable, much less why it should be square-integrable, with respect to 
the spectral measures fz, of H, so that (1) and (2) need not be make sense. 

The difficulty comes from the fact that H may be neither injective, nor 
surjective.’ The first one is not serious. The subspace Ker(H) of x for which 
Hx =0, i.e. such that (z|Hy) = 0 for all y, is orthogonal to the subspace 
Im(H). The orthogonal complement of Ker(#), i.e. the closure of Im(#), 
is a closed H-invariant subspace on which H is injective. It is, therefore, 
reasonable, to work in this subspace, in other words, to suppose that H 
is injective, which is what we will do in what follows. The subspace D = 
Im(#) is then everywhere dense in 1. By the way, note that, by (22.14), the 
injectivity of H is equivalent to 


fx,c({O}) =O forall eEeH. 


87 Counterexample on H = L?(N): the operator defined by Hf(n) = 0 if n = 0, 
Hf(n) = f(n)/nifn #0. 
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In these conditions, H has an inverse 
H-':D=Im(H) —-H 


given by 


y=H'e v= Hy. 


This operator, which is not defined everywhere and even less continuous, has 
very remarkable properties. 


Theorem 34. Let H be an injective continuous Hermitian operator on a 
Hilbert space H and H~':D =Im(H) — H its inverse. 
(a) The elements x € D are characterized by the relation 


(23.3) [2 rditeaQ) < +20, 


(b) The following hold: 


H-'Hxe=2x forallxeH, 
HH-'x=«a forallxeD. 


(c) Every continuous operator A commuting with H maps D to D and 
satisfies 


AH"'c=H-'Ax forall reD. 


(d) The function A~! is integrable with respect to fix, for allx €D and 
y EH and 


dpty—12,y(A) = AW *dptey(A) ; 
(23.4) (A~t2ly) = fA *dpzy(A) force D and yEH. 


(e) The operator H~' is self-adjoint: 
(H~*aly) =(2|H~'y) for z,yeD, 


and the only y € H for which the map x ++ (H~'2\y) from D to H is 
continuous are the elements y € D. 

(f) For any Borel set w Cc X whose complement is a neighbourhood of 
0 in X, H(w) CD, H~' maps H(w) to H(w) and the restriction of H~+ to 
H(w) is continuous. 


Proof of (a). We begin by observe that, {0} being a null set with respect 
to all spectral measures of H, the functions \~! and \~? are defined almost 
everywhere with respect to these measures. Let us then consider a vector 
x = Ha’ € Im(#), whence 
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Aptx,« (A) = NM dpc! x! (A) 


by property (MS 4) of spectral measures [n° 22, (ii)]. As the function A~?\? 
is almost everywhere equal to 1, and hence integrable with respect to Uy’,.’, 
theorem 25 shows that A~? € L1(X; pz,x), proving the necessity of (3). 

Conversely, let us suppose that (3) holds and write B°(X) for the set 
of bounded Borel functions on X. It is everywhere dense in L?(X;fx,.). Let 
H(a) be the closure of the set of M(f)x in H, where f € B™(X). As 


|IM(Ael? = / If Pdits.« 


by (22.9), the map f +> M(f)a extends to an isomorphism j from 
L?(X3px,2) onto H(x). Relation M(fg)x = M(f)M(g)z, ie. j(fg) = 
M(f)j(g), which holds for all f,g € B®(X) by n° 22, (ii), shows that 7 
transforms the multiplication operator p++ fy in L?(X; z.2) — a continu- 
ous operator since f is bounded — into the operator M(f) in H(x). Therefore, 


Ife) = M(f)i(¥) 


for all f € B®(X) and all bounded and unbounded y € L?(X; pe,x)- 

This being so, let us suppose that x satisfies (3). The function y(A) = A~+ 
is in L?(X;pz,x), the function f(A) = \ is bounded and Borel on X and 
fie = 1 almost everywhere with respect to fiz... So M(f)j(y) = M(fey) = 
M(1) =«. Thus there exists x’ = j(y) such that « = M(f)a’ = Ha’, which 
is what needed to be proved. 

Proof of (b). Set theory, chap. I. 

Proof of (c). If A commutes with H and if x € D, then there exists x’ such 
that x = Ha’, whence Ax = Hy’ where y’ = Az’. As a result, Ax = y = Hy 
is in D and H-1 Agr = H-ly =y! = Av’ = AH“2. 

Proof of (d). For « € D, we once again consider the isomorphism j : 
L?(X3px,2) —> H(x) used in point (a) of the proof. As was then seen, 
x = j(y), where y(A) = A“! and x = Hz’, where x’ = j(f), f(A) =A. Then, 
for all y € H, 


Apt y(A) = dura y(A) = A.dpary(A)- 


This relation obviously continues to hold if the measures and the function A 
are replaced by their absolute values [n° 17, (ii)]. As A~+A = 1 is integrable 
with respect to |/12/,y|, theorem 25 shows that A~! is integrable with respect 
to |Ux,,| and that 


Ah dptey(A) = dpe y(A) ifs’ =H se, yeH, 


which is the first relation in (4). Integration of the function 1 with respect to 
these two measures leads to the second one. 
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Exchanging the roles of x and y and taking conjugate measures, the for- 
mula 


AT dhte,y(A) = Aftay (A) if y! = Holy, eS H 


is obtained in a similar way. Replacing y by y’ in the last but one formula, 
once again thanks to theorem 25, which is in fact almost trivial in this case, 


* A? dite y(A) = dite y (A) if 2 = Hote and y! = Hy. 
y y 


Proof of (e). These relations show that 


(H~*zly) a (a|H~*y) => [rds = [te gQ), 


which is clear and shows that, for all y € D, the linear functional « +> 
(H~'zly) is continuous on D. Conversely suppose that, for some y € H, 
there is a vector y’ such that (H~‘z|y) = (zly’) for all x € D, hence of the 
form « = Hz’. As Hx € Im(H) = D, x may be replaced by H« in this 
relation, whence (a|y) = (Hzly’) = (a|Hy’). Since D is everywhere dense, 
one deduces that y = Hy’ € D. 

Proof of (f). If the open set w avoids the neighbourhood of 0, then there 
is a number m > 0 such that |A| > m for all A € w. But for all ¢ € H(w), 
Lx,2(X —w) = 0 by (22.12). The bounded function A~? on X —w, is, therefore, 
integrable with respect to fz,2, whence x € D. Furthermore, 


|x|? = / dyiyete,4-12(0) = / A~2djtee(A) by (*) 
< m=? | dptee(A) = ma? 


Thus H~! is continuous on H(w). H~' maps H(w) to H(w) since H~!M(w) = 
M(w)H~", which is a very particular case of (c) for A = M(w), qed. 

Exercise 1. Let A be a GN algebra in H and y a real Borel function on the 
spectrum of A. Find the assumptions needed to define M(y) as a self-adjoint 
operator. 


(ii) Canonical extension of a positive symmetric operator. Point (e) of the 
statement implicitly defined the notion of a self-adjoint operator. To under- 
stand it, consider an everywhere dense subspace D of H and a symmetric 
map S : D —> H (I prefer keeping the term “Hermitian” for continuous 
operators), i.e. such that 


(Sazly) = (a|Sy) for z,yeD. 


For all y € D, the linear functional 7 + (Sly) is therefore continuous on 
D. But it may also be so for some y ¢ D. Let D* be the set of such y. It 
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is a subspace of H containing D, and associating to each y € D*, its unique 
vector y* such that (Sz|y) = (a|y*) for all x € D, we define a map 


S*:D* -H 


naturally called the adjoint map of S extending S. S' is said to be self-adjoint 
if D* = D, in which case S does not admit any other symmetric extension 
apart from itself. This is exactly what proposition (e) of the previous theorem 
says. 

When a symmetric operator S is not self-adjoint, it is not always possible 
to find a self-adjoint operator extending it. I will not address this problem 
in the general case,°* but in the far simpler and often used case, where 5 
is positive, i.e. satisfies (Sx|z) > 0 for all « € D. Then there is a canonical 
self-adjoint extension of S. It is a consequence of the next result: 


Theorem 35°°. Let H be a Hilbert space, D an everywhere dense subspace 
of H and S a positive symmetric operator defined on D. Set 


(23.5) m = inf (Sz|z) / (2x) . 


Ifm > 0, there is an injective, positive, continuous Hermitian operator H in 
H having norm < 1/m such that 

(a) HS =id on D, 

(b) H commutes with every unitary operator U such that®® 


U(D) CD, USx=SUa forallxreD. 


Ifm =0, then there is an injective, positive, continuous Hermitian operator 
H in having norm <1 such that HS =1—H onD and satisfying (b). 


The case m = 0 immediately reduces to the previous on by taking S' + 1. 
Hence, one may suppose m > 0 and even m = 1. 

The proof only uses the elementary results of n° 19. It consists in con- 
structing a second Hilbert space H’ and a bijection from H’ onto a subspace 
of H. There is a tendency to identify H’ with this subspace, but it is then 
impossible to know whether its vectors should be considered to be elements 
of H or of H’. This book not being aimed at experts, I will use somewhat 
clumsy but precise notation in order to avoid such possible confusion. 


88 See for example Dieudonné, XV.13 or any other treatise on Hilbert spaces. 

8° According to Béla v. Sz. Nagy, Spektraldarstellung Linearer Transformationen 
des Hilbertschen Raumes (Ergebnisse der Math., Bd. R, 1942, p. 35-36), a book 
that in earlier times was like the Bible for me, this theorem was first proved by 
von Neumann (1929), then by M. H. Stone in his book on Hilbert spaces (1932). 
The proof in the text is due to Hans Freudenthal (1936) and simplifies a proof 
by Kurt Friedrichs, Spektraltheorie halbbeschrnkter Operatoren (Math. Annalen, 
109 and 110, 1934). French PDE tradition attributes it to Gelfand, Lions, Kato, 
etc. Looking at it this way, it could also be attributed to me since I was using it 
in 1946-1947... 

9° Tf these properties hold, U will be said to commute with S. 
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(i) Let us consider the positive Hermitian form 


(23.6) (ly) = (Sey) 


on D. It enables us [n° 19, (i)] to construct a Hilbert space H’, whose norm 
will be written ||z||’, and a canonical map 


r:D—>H' 
from D onto an everywhere dense subspace D’ of H’. Hence 
(23.6’) (x(x) |m(y))’ =(Saly) for z,yED 
and, by assumption (5) for m = 1, 


/ 


lel] <||n(w)||" for ee D. 


Thus the map 7 is injective. Furthermore, the set of 7(x) being everywhere 
dense in H’, there is a unique continuous linear map 


J:H —H, 
having norm < 1, such that 
J[r(x)) =a forallaeD. 


(6’) then shows that (u|v)’ = (SJu|Jv) for u,v € D’. But for given u € D’, 
both sides are continuous functions of v € H’ since J is continuous. Hence, 
as D’ is dense in H’, 


(ulv)’ =(SJu|Jv) forweD’, veH’. 


In particular, Ju = 0 implies (ulv)’ = 0 for all u € D’ and sov = 0. Asa 
result, J is injective. 
(ii) Since J : H’ —> H is continuous, it has an adjoint [n° 19, (iii)] 


J :HOH 
having norm < 1 like J. By definition, 
(J*alv)’ = (a|Jv) fora eH and vEeH', 


which shows that J*x = 0 implies (z|Jv) = 0 for all v € H’. Thus x = 0 since 
J(H’) is everywhere dense in H. The operator J* is, therefore, injective (as 
is the adjoint of any operator whose image is everywhere dense). 

Replacing x by Sz and v by x(y) for y € D in the previous relation, for 
ally € D, 


(J*Sa|m(y))’ = (Sal Jn(y)) = (Saly) = (n(z)|r(y))' 
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So the set of 7(y) being everywhere dense in H’, J*Sx = x(x) for all x € D. 
Hence 


JJ*Sx=«x foralreD. 
The operator 
H=JF:HOH, 


is continuous, has injective norm < 1 like J and J* and satisfies condition 
(a) of the statement. H is positive Hermitian like any operator of the form 
AA*. 

(iii) Let us consider a unitary operator U on H which preserves D and 
commutes with S on D. Then, for z,y € D, 


(x(Ux)|n(Uy))’ = (SUa|Uy) = (USx|Uy) = (Szly) = (x(x)|m(y))’ - 
As a result, there is a unitary operator U’ on H’ such that 
roU=U'on. 


By definition of J, it is written J(U’y) = UJ(y) for all y € D’. The same is 
true (continuity) for y € H’, whence JU’ = UJ. This implies 


aU andso UF Se 


since U* = U~! for all unitary operators. The fact that U commutes with H 
is then obvious. qed. 


Let us now show how to deduce from the theorem a canonical self-adjoint 
extension of S, i.e. commuting with every unitary operator commuting with 
S. 

If m > 0, it is the inverse H~! of H defined in theorem 34. It is positive 
like H, defined on the image of H, hence on D, and as H~'H = 1, clearly 
H-'g = Sa for all z € D. 

If m = 0, we start from the relation HS = 1— H and write the spectral 
decomposition 


a= f rama, 1-H= f a—»am) 


of H, where X C [0,1] is the spectrum of H. For x € D, HS‘ is the image of 
H, hence also x = HSx + Hx. So H~! may be applied, the outcome being 
the relation 


(23.7) Sce=H 2-2. 
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As a result, the self-adjoint operator 


H*-—1= | (A! — 1) dM(A) 
xX 


1 


extends S. It is defined on the same vectors as H~*, i.e. on x € H such that 


| rN djte.a(A) < +00. 
xX 


Exercise 2. Suppose H = L?(R), D = L(R) and Sf = yf, where y > 1 is 
such that y? is locally integrable. Set out explicitly H’, J, J* and H in this 
case. Compute the self-adjoint extension of S assuming only y > 0. 


Corollary 1. Let S be a non-trivial positive symmetric operator defined on 
an everywhere dense subspace D of H. There is a non-zero positive, contin- 
uous Hermitian operator S’ on D such that: 

(a) (S’a|x) < (Sa|x) for alla € D, 

(b) S” commutes with all unitary operator commuting with S. 


Indeed, let us choose numbers a and b such that 0 < a < 6b < 1 and 
consider the spectral projection 


M(a,8)=M ((a,8) = fac) 
[a,] 
of the operator H of the theorem. Since 
(H~*— 1) M(a,b) = / (A7* — 1) dM()) 
[a,b] 


with a > 0, this operator can be extended by a continuous operator S’ on 
H, which is obviously Hermitian positive and commutes with every unitary 
operator commuting with S. 

5S” satisfies condition (b) — obvious — and condition (a) since 


(Sala) = Jia.) (A-1 = 1) dpee(A) for cE H, 
(Sz|x) = Sioa (A711) dpz,e(A) for ve D. 


On the other hand, the kernel of S’ is the set of x such that 
\|S’a||° =) (At = i Cit N=). 
[a,b] 


Since 0 < a < b < 1, this means that [a,b] is a null set with respect to 
[ty,2. If, for given x € D, this is the case for all a and 6b, the measure juz, is 
concentrated on the subset {0,1} of the spectrum of H. As H is injective, 0 is 
not an eigenvalue of H and as a result, fuz,2({0}) = 0. Thus pz, ((0, 1[) = 0, 
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whence Hx = x. Therefore, S’ = 0 may hold on D for all choices of a and b 
only if H = 1 on D. But then relation HS = 1-H shows that HS = 0, and 
so S = 0 since H is injective, qed. 


Corollary 2 (Schur’s lemma IT). Let (H,U) be an irreducible unitary rep- 
resentation of some leg G and S a self-adjoint or positive symmetric operator 
defined on an everywhere dense subspace D of H. Suppose D is U(x)-invariant 
and that U(x)S = SU(x) in D. Then S is a scalar. 


If S is positive symmetric, the canonical self-adjoint extension S com- 
mutes with all U(x), hence so do its spectral projections, which being con- 
tinuous, are scalars, proving the result in this case. 

If S is not positive, but self-adjoint, consider S +7. As in n° 19, (iv), 


Thus S +7 is injective. If lim(S + 2)x, = v exists for x, € D, the sequence 
(an) satisfies Cauchy’s criterion, and so converges to some xz € H. Thus 
lim Sz, = y = v — 12 exists. But then, for all u € D, 


(S + i)al|? = ||Sal|? + |lal|? for all « € D; 


(Sulz) = lim (Sula,) = lim (u|Sz,) = (uly) - 


Hence x € D and Sx = y since S is self-adjoint. The limit y + za of the 
(S +i), is, therefore, (S+7)x, and so the subspace Im(S +7) is closed. Any 
vector y orthogonal to all (S + 7)x satisfies (Sz|y) = —7(a|y) for all x € D, 
and thus is in D since S is self-adjoint, and since Sy = iy then holds, it 
follows that y = 0. Finally, Im(S +2) = H. So S +7 is a bijection from D 
onto H, its inverse being also continuous and having norm < 1. (See exercise 
below for the rest. .). 

That being so, if S commutes with all U(a), so does S$ +7, and hence also 
its inverse, which, being continuous, is a scalar by Schur’s lemma I [n° 22, 
(iii), end]. Thus so is S, qed. 

Exercise 3 (von Neumann, 1930). Let H be a self-adjoint operator defined 
on a subspace D of H. We have just seen that H +i: D — H is bijective 
and that the inverse map (H +i)~! from H to D is a continuous operator. 
(a) Show that the map U : «+> (H —i)(H +1)7!a from H to H is unitary 
and that 


1-U=2(H+i)'. 
(b) Let 


ur = f wr.amtu) 


be the spectral decomposition of U. Show that the spectral manifold H({1}) 
is trivial. Denote by dM (A) the image of dM(u) under the homeomorphism 


ur > A=i(1+u)/(1—w) 
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from T — {1} onto R. Show that this gives a non-degenerate spectral measure 
on R such that 


Ws / y.dM(d), 


the elements x € D being characterized by \?dpiy,2(A) < +00. 


24 — Continuous Sum Decomposition 


(i) Virtual eigenvectors. In finite dimension, any commutative GN algebra A 
is diagonalizable: if, for every character x of A, H(x) denotes the subspace 
of x € H such that 


(*) Tx=y(T)« forall TEA, 


then the subspaces H() are pairwise orthogonal and H is the direct sum of 
these subspaces. 

This is obviously no longer the case in infinite dimension — relation (*) 
usually has no solution « 4 0 -, but we will see that, nonetheless, there is 
an analogous result provided “eigenvectors” not belonging to H are allowed, 
“direct integrals” or “continuous sums” are substituted to standard direct 
sums and H is assumed to be separable. 

So let A C L(H) be a commutative GN algebra X its spectrum and let us 
suppose that H is separable. Once and for all, we choose linearly independent 
vectors a, whose linear combinations are everywhere dense in H, and denote 
by Ln the spectral measure jiz,, for £ = a,. There are numbers a, > 0 such 
that 


S > anllan ||? = S> anpin(X) < +00, 


which enables us to define the measure pp = )> Anbn by U(f) = 5 antn(f). 
This is a trivial case of a continuous sum of measures. The measures j4, are 
clearly bounded above by pu, up to some factors, so that if w C X is a Borel 
and p-null set, then 


|| M(w)an||? = (M(w)an|an) = Un(w) =0. 


So M(w) = 0, and hence (M(w)a|a) = 0 as well for all x. As a result (LN), 
all measures tz, are absolutely continuous with respect to and relations 
M(w) = 0 and p(w) = 0 are equivalent. Any measure 4 > 0 with this 
property is suitable for the following arguments. 

Let us set 


Pay € I1(X, ») = L since Hx,y is bounded. Functions p,;,, clearly have the 
same formal properties as the measures jiz,,, for example 
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(24.1) Py,c(X) = Pey(X), Pz,2(x) >= 0, 


but these relations hold almost everywhere only with respect to yw. If they 
were true for all y (which is the case if A is finite), the function (x,y) 6 
Pz,y(x) would be a positive Hermitian form on H with respect to all y. 
Taking the quotient by the subspace of x such that pz,«(x) = 0 and then its 
completion (n° 19, beginning) would lead to a Hilbert space H(y), with a 
canonical map from H to H(x) which it would be natural to write 


£t— x(x); 


Pey(X) = (z(x) yx) 


would then hold and as a result, 


(M(¢)zl|y) = [eco (x(x)|y(x)) du(x) 


for all x,y and bounded measurable functions y on X. Thus the space H 
would seemingly be the “continuous sum” or “direct integral” of spaces 
H(x) with respect to yz, operators M(y) likely mapping a function x(x) to 
the function y(x)#(x) with, in particular, the relation 


y=Tx y(x) = x(T)2(x) for all Te A 


analogous to (*)... 

This far too naive argument needs to be corrected using the null-set elim- 
ination machinery. The method, due to von Neumann,”! consists in coming 
back to the initial pseudo-basis (a,,) and in assuming x = a;, y = a, in (1). 
The corresponding functions pz, = pi,; being chosen once and for all, rela- 
tions (1) hold outside null sets N;;, and thus do so simultaneously outside a 
null set. The latter can even be chosen in such a way that for all y € X —N, 
the matrix (p;,;(x)) is positive Hermitian, in other words satisfies 


(24.2) Yo Pig OD&E; > 0 


I<i,j<n 


for all integers n and coefficients £1,...,& € C. Indeed, setting « = > &ai, 
the left hand side of (2) is just pz.x(x), up to a null set, so that (2) holds 
outside a null set Nz. But to check (2) for given n and x and arbitrary 
coefficients €;, by continuity, it suffices to do so when the set of €; is every- 
where dense in C, hence in a countably infinite number of cases. By making 
n vary, one finally gets a countably infinite number of sets Nz. to be elimi- 
nated. Taking their union with the null set introduced above gives a null set 
N outside which formulas (1) and (2) hold for all ordered pairs (#4, 7). 


°! On rings of operators. Reduction theory (Annals of Math., 50, 1949). 
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Let us denote by A the (everywhere dense but not closed) vector subspace 
of H generated by all ay. If x,y € A, then 


(24.3) r= S- EnQn, Y= S- Nn@n 


with well-determined coefficients since the a,, have been assumed to be lin- 
early independent. Hence 


(24.4) Pay (X) = S- £7) ;Pi,3 (x) ae. 


The functions p;,, being determined up to null sets, for ordered pairs of form 
(3), (4) can be taken to define pz.y(x) for all y ¢ X —N. For these x, by (2), 
the map (x,y) > pz,y(x) is then a positive Hermitian form on the subspace A 
of H consisting of vectors (3). Replacing all p;,;(x) by 0 on N, (4) may even 
be assumed to hold for all x. 

Arguments similar to those used at the beginning of n° 19 lead to a 
genuine Hilbert space (x) and to a linear map x +> x(x) from A onto an 
everywhere dense subspace of H(x), a map for which, for all y, 


(x(x)|yOX)) = Pey(x) for al z ye A. 
The left hand side is a ji-integrable function. Since 
(M(p)aly) = f eCddiegx) = f C0) Palddel) 
= / 2(X)Px,y(x)-du(X) » 


the formula 


(24.5) (M(y)aly) = [oo (x(x)|y(x)) du(x) 


becomes legitimate for x,y € A and for every bounded Borel function y. In 
particular it shows that 


(24.6) let? = f leo? dua). 


\|z|| is thus the norm Np, of the function ||a()||, which is in L?(X, 2) since 
22 = Peve(x) is in E1(X, p). 

Let us now show that a function x(y) € H(y), defined up to a null set, 
can be associated to each « € in such a way that (5) continues to hold for 
x,y © H. We start by remarking that, for all x € H, there exist x, € A such 


that 
La oy & S-|l2nl| < +00. 
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Since the second relation can be written 


¥(fteatalduty) <0 


theorem 6 of n° 4 applied for p = 2 to functions ||x,,(y)|| shows that the series 
> ||an(v)|| converges ae. and in L?(X; jz) to a square integrable function F'(y) 
with respect to 4. Thus the series }* xn (y) converges ae. to some z(y) € H(x) 
satisfying ||z(x)|| < >> |lenQx)|| = F(x) ae. Setting sp = a1 +...+ ap € A, 
x(x) = lim p(x) ae., the functions ||s,(x)|| being, moreover, in L?(X;,) and 
dominated by F’. Therefore, the the function ||x()|| is square integrable and 
its norm N2 is bounded above by the sum of norms No of functions ||an(x)|l- 
Replacing x by x — s,, this shows that 


tim f I2(x) = spo)? du (x) =0. 


It follows that the function y +> x(x) only depends on the vector x € H, 
up to a null set. Indeed, if = > z/, is another way of approximating x by 
x}, € A, the series }> x, — x/, converges to 0. The previous result then shows 
that 


tin [ |[sp(0) — 89(0)||? aul€) = 0. 


Hence a partial sequence tending ae. to 0 can be extracted from the sequence 
of functions ||sp)(x) — s;,(x)|], and as anyhow the complete sequences s,(x) 
and s‘,(x) converge ae., this is sufficient to prove that 


YS) 2(x) = - 2h(X) pp 


as expected. 

This being settled, let us prove (5) for x,y € H. To this end, we choose 
sequences s, (resp. t,) in A satisfying the following conditions: (a) they 
converge to x (resp. y) in H, (b) s(x) (resp. tn(x)) converges to x(x) (resp. 
y(x)) in H(x) for almost all y, (c) the functions ||s,(x)|| (resp. ||tn(v)||) are 
dominated by functions F (resp. G) in L?(X;). Then, by (5) applied to A 
and the continuity of M(y), 


(M()ely) = lim (M(¢)sp]|tp) = lim / 2x) (Sp(3) [to(o) dulx) - 


As the functions (sp(x)|tp(v)) are (Cauchy-Schwarz) dominated by FG € 
L'(X; 1) and converge ae. to (x(x)|y(x)), Lebesgue’s theorem immediately 
least to (5) for x,y and even (replace y by yw) to 


(24.7) (M(v)a|M()y) = / v(x) BOO (a(x) yo) du) 
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for all bounded and measurable y and w. 
Let us now show that 


(24.8) y = M(p)x => y(x) = o(x)2(x) PP. 


and that in particular 


y= Tr => y(x) = x(T)2(x) ae. 


for all T € A. This result is essential if we want the x() to resemble common 
eigenvectors of all T € A. Indeed, by (7), for y= M(y)z, 


0=|M(y ye if = (M(p)x|M()x) — 2Re (M(y)aly) + (uly) 
= f lee? HeGoll? dutx )=2Re f (x) (c ()) due(x) 


+f Iycol? aubo) 
= f aux) [ied2O0I? = 2Re (eOd20)l¥00) + IlGOI? 


The expression under the f{ sign is just ||~(x)x(x) — y(x)||?, whence 


J lieo200 ~ vod dug =0 


and (8). 
Finally the functions x(y) thus obtained need to be characterized. The 
following two conditions answer the question: 
: 8) the function (x(x)|y(x)) is measurable for all y € A, 
b) f llxoo|l2du* (x) < +00. 
ee suppose that these two conditions hold. The function ||#(y)]|] is then 
measurable since 


IlzQ)Il = sup |(2(x x) / llyood 


provided y € H is made to vary in such a way that the set of y(x) is ev- 
erywhere dense in H(y). For this to be the case, it suffices to chose linear 
combinations of a, with “rational complex” coefficients. The set of these y 
being countable, condition (a) immediately implies that the function ||x()|| 
is measurable. Condition (b) then shows that it is in L?(X; 1). 

For two functions x(x) and y(x) satisfying (a), the inner product (a(x) |y(v)) 
is measurable because of the age-old identity 


4B(z,y) = Ble+y,r+y)— 


valid for all Hermitian forms. If x and y also satisfy (b), the function 
(x(x) |y(v)) is integrable (Cauchy-Schwarz). By (5) applied to y € H, 
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“< f ieco IP aul ). Fluo VI dyulxe) 
= Ilyll? / leOOIP dbo. 


Hence there exists (n° 19, theorem 31) x’ € H such that 


| (at Need) 


(2'ly) = | (eb)lwOd) dud for all ye 2, 


J (200 =2'GoluGd) dub) =0 for all ye H. 


By (7), replacing y by M(y)y multiplies y(y) by v(x) , whence 


[eo (x(x) — 2"(x) yx) du(x) = 0 


for all y € H. As ¢ is arbitrary, (x(x) — x’(x)|y(x)) = 0 ae.. Applying this 
result to the vectors y = a,, by eliminating the null sets, we conclude that 
x(x) = 2'(x) ae., which completes the construction. 

In conclusion, starting from the given algebra A in H, it is possible to 
find a measure 4 on X and a decomposition of H into continuous sums 
(with respect to 41) of Hilbert spaces H(x) so that all T € A multiply the 
“components” x(x) for all « € H in H(x) by T(x), the M(y) multiplying 
them more generally by v(x). This is the perfect analogue of a simultaneous 
diagonalization, and like in finite dimension, this is the result that should 
be called the spectral decomposition of A. The number dim #(x), defined 
almost everywhere, plays the role of the “multiplicity” of the “common 
eigenvalue” y of Tc A. 

The difference with the case of finite-dimensional spaces is the availability 
of a wide choice of measures yz. The only canonical aspect of the construction 
is the class of yw in the sense of Lebesgue-Nikodym. Changing 4 would modify 
H(x) only almost everywhere and would multiply x() by scalars independent 
of x. This is the best that one can hope for in this abstract and general 
context. 

In concrete examples, 1 is often a function space of L? type. Our hope 
is then to be able to consider the (x) as spaces of eigenfunctions (or distri- 
butions) of operators M(y). This requires methods from analysis far beyond 
the framework of integration theory. 


(ii) Continuous sums of Hilbert spaces. In all of the above, we have talked 
of continuous sums or of direct integrals of Hilbert spaces as physicists do, i.e. 
without mathematically defining this notion. We can now do so, and I will 
content myself with it in order not to lengthen the presentation excessively. 
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For this, let us take a locally compact space®? T’, a measure ps > 0 on T 
and, for all t € T, a Hilbert space H(t). Direct sums of trivial spaces being of 
little interest, it is reasonable to suppose that H(t) 4 {0} almost everywhere. 
For the same reason, it is reasonable to suppose that the support?? of y is all 
of T. 

As “measurable” or “square integrable” functions x(t) with values in 
H(t) need to be defined, which on the fact of it has no meaning if the spaces 
H(t) are unrelated to each other, suppose given in advance a family A of 
functions x(t) with values in H(t), defined up to null sets, and satisfying the 
following conditions: 


(VN 1) ax+ Bye Aforalla,ye Aanda,fPeEC; 

(VN 2) the function («x(t)|y(¢)) is measurable for all x,y € A; 

(VN 3) there is a countable set D C A such that, for almost all t, the 
set of x(t), « € D, is everywhere dense in H(t). 


It is possible to dispense with countability in (VN 3) by having recourse to 
Lusin’s theorem and to equimeasurable families, but this would needlessly 
complicate matters. 

These assumptions enable us to define functions x(t) that will be referred 
to as measurable: they are required to satisfy the condition that (a(t)|y(t)) 
is measurable for all y € D; by (VN 2), this is the case for all a € A. If 
x(t) is measurable, then clearly so is y(t)a(t) for any bounded measurable 
scalar function y and any limit-ae. of measurable functions is measurable. If 
x(t) is measurable, then so is ||a(t)|| by (VN 2) and (VN 3). If x and y are 
measurable, then so is the function 


(x(t)|y(t)) = |l2(¢) + y@)IF /4-..-. 


A function x(t) will be said to be square integrable if it is measurable and if 


(24.9) [leo au < +00. 


The existence of such functions is immediate: choose a measurable function 
x(t) which is not zero ae., replace x(t) by x(t)/||x(t)|| if a(t) A O and by 
0 if a(t) = 0, and finally multiply the result by an arbitrary function y € 
L?(T; 1). 

This said, the continuous sum H of H(t) with respect to py is, by definition, 
the set of classes of square integrable functions x(t), equipped with the inner 
product 


°2 In the chapter of his book on von Neumann algebras dedicated to this 
topic, Dixmier even considers the case of an “abstract” measure because the 
“spectrum” of a non-commutative GN algebra, equipped with its natural topol- 
ogy, is not always locally compact in a strict sense. 

°3 Open sets such that (U) = 0 that form an increasing filtering family, their union 
also being a null set. The support of ps is its complement. Saying that the support 
of is T means that 0 is the only function f € L+(T) such that p(f) = 0. 
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(ly) = / (x(t)|y(t)) dutt).. 


By (8) and Cauchy-Schwarz, the integral converges for x and y. One can then 
associate a continuous operator M(y) on H defined by 


y=M(y)e if y(t) = v)a(t) 


to any y € L©(T;). Formulas (22.4) of spectral theory continue to hold 
here: || M(y)|| < || fll, where this is the norm of f in L°(T; 1), and in fact 


(24.10) IM(Y)Il = IF lloo 


as is easily seen. 

By confining ourselves to continuous y on the Alexandrov compactifi- 
cation of T, we thus get a self-adjoint algebra A in H containing the unit 
operator. As the support of jz is assumed to be T,, the norm N. of such a func- 
tion is the same as its uniform norm ||/f||. Then because of ||1/(y)|| = ||y|l, 
A is closed in the Banach space £(#), and so is a commutative GN algebra. 
As was seen at the end of n° 12, its spectrum is the compactification of T. If 
T is compact, a case it can easily be reduced to, we thus clearly recover the 
spectral decomposition of A in the sense defined above. Hence decomposing 
a Hilbert space H into a continuous sum of spaces H(t) reduces to choosing a 
separable GN algebra A in H and to carrying out the spectral decomposition. 
For example, A may be assumed to be generated by a Hermitian operator 
H, in which case T is the spectrum of H, which acts on each H(A) as the 
scalar 4, like in finite dimension. . . 

Exercise 1. Show that an equivalent definition®4 of direct integrals would 
consist in taking T, 4, separable Hilbert spaces H and H(t), t € T, and in 
assuming that a class of functions x(t) € H(t) satisfying the following con- 
ditions is associated to each x € H: (i) linearity, (ii) (7(£)|y(t)) is integrable 
for all x,y € H and (zly) = f(x(t)ly(t))du(t), (iii) every function x(t) such 
that (x(t)|y(¢)) is integrable for all y € H corresponds to some x € H. 

Finally, let us give some indications about “decomposable” operators in 
a direct integral. Let A(t) be a function whose values are continuous linear 
operators in H(t). Assuming the functions A(t)a(t) to be measurable for all 
x € D and the function ||A(¢)|| to be bounded, there is a unique continuous 
operator A on H such that 


y = Ax y(t) = A(t)x(t) pp. 


“Decomposable” operators obtained thereby commute with all M(y). Con- 
versely, any operator A on H commuting with all M(y) is decomposable. On 
the other hand, suppose given a von Neumann algebra A in H. For almost all 
t, there is a von Neumann algebra A(t) in H(t) such that the relation A € A 


°4 See for example Richard Kadison and John R. Ringrose, Fundamentals of the 
Theory of Operator Algebras (AMS, 4 vol., 1999-2001), chap. 14. 
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is equivalent to A(t) € A(t) ae. if and only if the algebra Z of operators 
M(y) is contained in the centre of A. This is also true for the commutator 
algebra A’, and A’(t) = A(t)’ ae. Etc... 


(iii) The L? space of the integral of a measure. At the end of n° 17, we 
mentioned the following question: given an integral of measures 


v= | todX(e), 


under what condition is the L? space of v the continuous sum (with respect 
to A) of L? spaces for j1,? As we saw, the answer is simple if there are finitely 
many [tz: they must be pairwise disjoint. Let us now consider the general case 
keeping the notation of n° 14 and giving its precise meaning to the notion of 
“continuous sum”. 

(a) To begin with, it is prudent to suppose X and Z to be Polish. L?(Z; v) 
and H(z) = L?(Z; ,) are then separable. For almost all x, each f € L?(Z;v) 
is square integrable with respect to 4 (generalized LF), hence defines a vector 
fe € H(z). For f,g € L?(Z;v), the function (f,|g.) is integrable and 


(24.11) (fla) = i, (felge) dX(e). 


The set A of functions 7 > f,, f € L(Z) obviously satisfies axioms (VN 1), 
(VN 2). To check (VN 3), we choose a countable subset D of L(Z) such that 
every f € L(Z) is the uniform limit of functions f,, € D vanishing outside a 
fixed compact set. The classes of f € D in the L? space of all measures on Z 
are then everywhere dense, so that (VN 3) is satisfied by choosing functions 
xt f, for f € D. 

This enables us to define measurable functions with values in H(x) and 
hence the direct integral H of H(x). By (10), it is clear that, for any 
f € L?(Z;v), the associated function x + f, is in H and that this gives 
an isomorphism from L?(Z;v) onto a closed subspace Ho of H, but not nec- 
essarily onto H as already seen in n° 17. 

(b) Let us now show that Ho = H if and only if, for all f € L?(Z;v) and 
all bounded measurable scalar function ¢, there exists g € L?(Z;v) such that 


Gz = p(x) fx ae. for r. 


This means that the subspace L?(Z;v) = Ho of H must be stable un- 
der operators M(p). This condition is clearly necessary.°° To show that it 


°5 Tt is not always easy to check in concrete examples, in particular in extensions 
of Fourier transforms to semisimple groups, when it is not immediately obvious 
what the operators M(y) are and, as far as I know, no author who has dealt with 
the topic has given them explicitly. If this is really the case, this means that the 
(generalized Plancherel) decomposition of the regular representation of the group 
into “continuous sums” of irreducible representations has not yet been proved 
in the strict sense of the word, even for groups such as S'L2(R) (Harish-Chandra) 
or SL2(C) (Gelfand and Neumark)... 
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is sufficient, let ++ h, be an element of H orthogonal to Ho. By as- 
sumption, [(fr|ha)d\(x) = 0 for all f € Ho, hence also by assumption, 
J 9(%)(fx|h2)dA(x) = 0 for all bounded measurable y, and so h, = 0 ae. 
follows from (VN 3), qed. 

(c) We suppose that this condition holds and let X = AUB be a partition 
of X into two measurable sets A and B = X — A. Setting 


(24.12) va f texale)dn), VB = f toxola)an), 


v =va+vgp and the space L? of v4 (resp. vg) can be identified with the 
subspace of f € L?(Z;v) such that f, = 0 outside A (resp. B). However, 
the operator E4 = M(ya) on a continuous sum of Hilbert spaces is, by 
definition, the orthogonal projection onto the subspaces of vectors whose 
“components” are trivial outside A. As a result, 4 (resp. Eg) maps L*(Z; v) 
onto L?(Z;v,4) (resp. L?(Z;vg)). The subspace Ho of H is, therefore, stable 
under all operators E'4 and 


(24.13) L?(Z;v) = L?(Z;v4)@ L?(Z;vx_,), direct sum. 


Conversely, this condition ensures that L?(Z;v) = H. Indeed, it shows that 
L?(Z;v) is stable under M(vy) for any function y which is the linear com- 
bination of characteristic functions of measurable sets. Now, any bounded 
measurable function y is the uniform limit of a sequence of functions yp, 
of the previous type. As ||M(y)|| = ||vll.o for all bounded y, the functions 
M(pn)f € Ho converge in H to g = M(y)f. Thus g € Ho since Ho is closed. 
Condition (11), therefore, holds, whence L?(Z;v) =H by section (b) above. 

(d) Hence, L?(Z;v) =H if and only if (13) holds for any measure subset 
A of X. By n° 17, (iii), this means that 


(24.14) va and vx_,_ are disjoint 


for all A. The generalization to finite or countable partitions of X is obvious. 
It would be tempting to conjecture that this condition means that the mea- 
sures fl, and py, are disjoint whenever z # y. As will be shown in point (e) 
below, this is almost necessary, but the following counterexample shows that 
it is not sufficient. 

Exercise 2. Take X = Z = [0,1]. Let m denote the Lebesgue measure and 
€, the Dirac measure at x. Choose tw, = m for x = 0, Ux = €g for x > 0 and 
\ =m + €9. Show that, for A = {0} and B = ]0, 1], v4 =vzg =m. 

(e) In the general case, we let D denote the diagonal of the product X x X 
and assume that (13) holds. Then the measures pz, and [ty are disjoint for 
almost all (x,y) € X x X — D. The condition is thus necessary, but not 
sufficient. 

If v4 and vg are disjoint, then indeed there is (LN) a Borel set N C Z 
such that 
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va(Z—N)=0, ve(N)=0. 
By theorem 22, or (13.7) applied to measures v4 and vp, 


(24.13’) fy(Z—N)=0 forall xe A—Na, 
Hy(N)=0 forall ye B-N, 


where N4 and N%, are A-null sets. However, the set Nap of (x,y) € 
A x B such that the measures jz; and py are not disjoint is contained in 
Ax N!,U Na x B, a null set by (LF). Hence 


rx \(Na,B) =0. 


Since X x X — D is open in the product, it can be covered by open sets of 
the form U x V, where U and V are open in X and disjoint (Hausdorff). As 
X x X — D is locally compact and separable, and so the union of a sequence 
of compact sets, X x X — D is the union of sets U, x V, where, for each n, 
U,, are V;, are disjoint open sets. As 


Un X Vn C Un x (X —Un), 


relation (13) for A = U, and B = X—U,, shows that the set of (x, y) € Un x Vn 
such that pz and jy are not disjoint is a null set with respect to \ x A, proving 
the result. This is the best that can be said since one can modify the measures 
Lz, on a null set without changing v. 

Exercise 8. Let G be a unimodular Icg acting properly on a space Z, v 
a G-invariant measure on Z and X the quotient measure [n° 15, (iv)]. Hence 
v = f u,dA(x) using notation (15.13). Show that the condition of point (d) 
holds in this case. 

Exercise 4. (a) Let yu’ and ww” be two positive measures on a locally com- 
pact space Z. For any f € L+(Z), set 


i / i 
BY get 5 Trl 
hsf 

Show that pu is the restriction to L,(Z) of a measure on Z and that a positive 
measure is bounded above by py’ and yw” if and only if it is bounded above 
by yw. Set uw = inf(y’, uw’). (b) Suppose Z to be separable. Show that in the 
previous formula, it is enough to make h vary in a countable set independent 
of ys’ and yw” . (c) Let X and Z be two separable locally compact sets, \ a 
positive measure on X and (w!,), (uw) two families of positive measures on 
Z measurable with respect to A. Set 2, = inf(u),, my) for x,y € X. Show 
that, for all f €¢ L(X x X), the function (2, y) + usz,,(f) is measurable with 
respect to A x and that the set of (x,y) such that uz,, = 0 is measurable. 
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25 — Convolution Product on a Lcg 


To illustrate the general statements in integration theory and in particular 
the Lebesgue-Fubini theorem, let us begin by show how they can be used to 
extend to a lcg G the convolution product which has already been encountered 
in Chap. VII (Fourier series) and even in Chap. II, n° 18, example 3 for dis- 
crete groups. In all that follows, unless otherwise mentioned, “measurable” 
or “integrable” over G or over G x G will be with respect to a left invariant 
measure dz or to the product measure dxdy. To simplify, G may be assumed 
to be countable at infinity, though this assumption is not needed for any of 
the results. 


(i) Convolutions and representations. Let us start by proving a reassuring 
result: 


Lemma 1. Let f be a function on G with values in a topological space. The 
function (x,y) > f(xy) is measurable on Gx G if and only if f is measurable 
on G. 


Indeed, the most primitive version of LF shows that l’on a 


i | Headey = | | F (2,47?) A(y)dedy = , Aly)dy i F (@,y71) de 


= [ Atyyay fF (oyy) Ay) ae 


= i | Pag) day 


for all F € L(G x G), where the function A is that of n° 15, (i). Therefore, 
the measure dxdy is invariant under the homeomorphism (2, y) > (xy, y~*), 
so that the latter transforms every measurable function into a measurable 
function. As it transforms f(xy) into f(x), (x,y) + f(xy) is measurable if 
and only if so is (x,y) > f(x), ie. if and only if so is f, qed. 

The most natural way of introducing the convolution product is to start 
with a representation (H,U) of G [n° 15, (ii)], which need not be unitary 
nor bounded. The function U(x)a being continuous for all a € H and hence 
bounded on all compact sets K C G, (n° 15) 


(25.1) sup ||U(x)|| < +00. 
cek 


The function ||U(a)|| = p(x) is, moreover, Isc since it is the upper envelope of 
the continuous functions 7 4 ||U(x)al|/|lal]. So, for any compact set K C G, 
there is point where p is minimum [Chap. V, n° 10, property (vi)]. This 
minimum cannot be zero since the functions U(x) are invertible, whence 
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(25.1’) inf |U()II >0. 


This being settled, the continuous function « + U(a)a can be integrated (n° 
4) with respect to any complex measure py such that 


(25.2) / A@dluley = Xoo. 
The vector 
U(wja= f U(wjadu(z) 


thus obtained depends linearly on a, and 


| fu@adu(e)| < [ W7@all aal(e) 


by (17.4), also holds for vector-valued functions. The operator U(j1) is thus 
continuous and 


WwIh s f olz)alnl(a). 
In particular, if the functions U(a) are isometric, in which case 4 must be a 


bounded measure, then ||U(t)|| < |||], where ||/|| = |u](G) is the norm of pw. 
If X are ys are two measures satisfying (2), first of all (n° 4) 


U(#)U (ua = U(e) i: U(y)a.du(y) = / U(#)U(y)a.duly). 


and 
(25.3) U(A)U(p)a = // U(ay)a.dr(x)du(y) . 


Since p(xy) < p(x)p(y), the double integral is bounded above by 


') | p(a)p(y).d]A|(ae)d|yl(y) = i p(t)d| d|(x). ip ply)dlul(y) < +00, 


up the a factor ||a||, and LF justifies the formal calculation. 

However, for any f € L(G), by (1’), the function |f(a)| is dominated 
by ||U(«)||, up to a constant factor. Hence, since ||U(ay)|| is integrable with 
respect to dlA|(x)d|u|(y), so is f(ay). Setting 
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(25.4) v(f) = / | f(xy)dd(a)du(y) 


defines a linear functional on L(G). It is positive if this is also the case of 
» and p. In the general case, \ and yu are linear combinations of positive 
measures bounded above by |A| and |j:|. This leads to the conclusion that (4) 
is a complex measure. 


(ii) Convolution of two measures. Regardless of whether representations 
of G are used or not, these arguments lead to the definition of the measure 


Nx pe: fry ff fley)ane duly y= ff ath f)dd(x) duly) 


provide only that 


(25.5) i, | If(2y)|dA\(a)d| p(y) < +00 


for all f € L(G). For example no restrictions need be imposed if one of the 
measures has compact support because, if A is concentrated on a compact 
set, the function y + [| f(ay)|d|A|() is in L(G). The measure v has compact 
support if this is also the case of A and yz since, denoting by K and H the 
compact subsets of G on which » and yw are concentrated, v(f) 4 0 only if 
the function f(xy) is not identically zero on K x H. This supposes that f is 
not identically zero on the compact set KH of wy where « € K and y € H. 
The measure vy is, therefore, concentrated on KH. In particular, if A and yu 
are the Dirac measures €, and e, at the points a and 6 of G, then 


Eq * €b = Eab, 


and, for any measure 4, the measure 


cos ni fi f flav)duly). 


is the image of yw under the translation g > ag. There is a similar result for 
[u * €q. Writing simply e for the Dirac measure at e reveals its role of unit 
element for the convolution product. 

The definition also makes sense if A and yz, hence Ax ps as well, are bounded 
since, for f € L(G), the function f(ay) is continuous and bounded on G x G. 
The measure v is then bounded and 


(25.6) A * gel < HAT Mell 
since integral (5) is < ||Al|.||vul)-I fll. 


As Ax ps is the image of A x w under (2, y) > xy, theorem 23 of n° 14 can 
be applied. It follows that: 
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(a) a function f defined on G and with values in a topological space 
is measurable with respect to v if and only the function (x,y) + f(xy) is 
measurable with respect to the product measure |A| x |p|, 

(b) a reasonable function f with values in a Banach space is integrable 
with respect to A * yz if and only if it satisfies (5), 

(c) relation (4) continues to hold in this case. In particular, (3) shows that 


(25.7) U(A* p) = U(A)U (pw). 


Exercise 1. Show that 


dx feet wdX(e) = f d*eedu(o) 


(apply the definition of n° 12) and compare the previous results to those 
obtained by directly applying the generalized LF theorem [n° 13, theorem 22, 
in cases where condition (a) holds]. 

Every measure is the continuous sum of punctual measures 


= i €, -dpu(x) 


and the convolution product, which from this point of view is just 


d= ff cox eydr(oiduly), 


consists in extending by linearity group multiplication to these sums. This 
is the argument which, a century ago, led Schur to define “the algebra” 
of a finite group: it is a vector space over C (or any other commutative 
field) having as basis the elements of the group, i.e. the Dirac measures, and 
equipped with a an associative and linear multiplication whose restriction to 
the basis is the composition law of G. In the case of a general Icg, the analogue 
of Schur’s construction is any one of the following: the algebra M14(G) of 
bounded measures on G or the algebra L1(G) equipped with the convolution 
defined below. The second choice is better than the first one which is far too 
extensive. 
The convolution product is bilinear (obvious) and associative: 


(25.8) (A * pL) kU = Ax (WV) 
provided 
(25.9) | | LF (eyz)| alAl()d|ul(y)dlv|(z) < +00 


for all f € L(G). Indeed then 


| f feeyane duly = [Huadu 
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by LF, so that 


| {[ fevane )du(y)dv(z = [f suzyars wl u)dv(z) 


is the integral of f with respect to the left hand side of (8). But it is also 


fae ) [f Hleverdnty) )dv(z )= [aXe ) f Favydus v(o), 


whence (8). The order of integration being of no importance, under assump- 
tion (9), relation 


dpa = f Axeeevdu(c) 


and other similar ones follow. 


The case of a unitary representation (H,U) leads to the definition of an 
operation js +> {4 on measures which plays an important role. It is obtained 
by calculating the adjoint of U(): 


(U(w)"ajb) = Twpbla) = f Wapblasan(e) = f (u (e) alb) ante) 
. / (U(@)alb) di(c), 
where 


(25.10) dji(ax) = d(x") 


is set to be the image under x +> x~! of the conjugate 7. The notation p* 
would be better since 


(25.11) U(y)* =U(ji). 
An immediate calculation shows that 

(25.12) (A* pe) = fxr 
if the convolution of and p exists, and if 


€e =€,-1 forall reEG. 
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(iii) Convolution of a measure and a function. We suppose du(x) = 
y(a)dx where ¢ is locally integrable. Then, formally, 


f) = [f fevareonay= ff Hav)setw)arta)dy 
= ff tee y.anayay. 


Thus, permuting the names of the variables x and y, 


= [fo eatevds 


(25.13) A* (x) = fe (y~*a) dX(y) 


where the function 


is, by definition, the convolution of the function y and the measure X. For 
example, 


(25.14) €y* P(x) =Y (y~'a) ‘ 
whence 
A* Y(x) = ic « p(a).dA(y) . 
No problems arise when y or \ has compact support, or else when y¢ is 
integrable and \ bounded, in which case y* A is integrable (since the measure 


X * is bounded) and satisfies 


A « glla < |All lela - 


If we now suppose that dA(x) = y(x)dx, then for any measure ju, formally, 


f= ff sevelwazauiy) = ff #00) A(y)“ldedy(y). 


So dv(x) = y * u(x)dx, where 


(25.13’) y * u(x) = fe (xy*) A(y™*) duly). 
In particular, 


(25.14) p+ ey(2) = 9 (ay) Aly). 


These factors A disappear if G is unimodular. 
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Theorem 36. Jf \ is a bounded measure, then \ * y is defined for all 
yp € L(G) and all p € [1, +00], and is again in L? and 


(25.15) IA * gllp < IAM-elle - 


A and y may be supposed to be positive by replacing them with their 
absolute values. We first need to show that, for all f € L(G), the func- 
tion f(xy) is integrable with respect to dA(x)y(y)dy, i.e. (theorem 25) that 
f(xy) y(y) is integrable with respect to dX(x)dy. First, it is measurable with 
respect to dA(x)dy: the function (x, y) 4 v(x) is measurable since y is mea- 
surable with respect to dx [n° 11, property (C7)], and so is (x,y) H f(xy) 
since it is continuous. By LF [Theorem 19, (iv)], it therefore suffices to show 
that 


(25.16) xc ) | renew) y)dy < +00, 


where a priori these are upper integrals. The integral with respect to y poses 
no problem since ¢ is locally integrable. As L(G) C L4(G), where 1/p+1/q = 
1, given left invariance of the measure needed to define || f\||q, 


[feu (xy) o(y)dy < llFllallvllp- 


So the left hand side of (16) is < ||Al]-||fllqll¢llp < +00, which proves the first 
result (corollary of theorem 30, n° 18). It remains to show that the measure 
A * y is in fact defined by a function satisfying (15). 

We start by observing that the measure d\(x)dy is invariant under the 
homeomorphism (x,y) > (a,x~'y) from G x G onto G x G since for all 
FeL(GxG), 


[[ Pena = | are) [ Penav= fare) [ P(e,2y) ay 


As this homeomorphism transforms the function f(ay)y(y) appearing in (16) 
into f(y)p(a—ty), if v = A* y, then 


A) = ff seme BING yay = [f rere y) dX(2)dy 
" eiddeabe 


LF then shows that, with respect to the Haar measure, the function x 
y(a~ty) is A-integrable®® for almost all y where f is not zero, hence almost 


°6 This result always seems incredible when first encountered: what does measura- 
bility with respect to dx have to do with measurability with respect to an almost 
arbitrary measure \? But we do not claim that y(a~*y) is A-measurable with 
respect to all y, we only claim that this is the case almost always. Exercise: 
suppose G = R and X(f) = Df (En)/n?, where n +> &y is a bijection from N 
onto Q. Find an interpretation for this result in this case. 
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everywhere if G is supposed to be countable at infinite (or almost everywhere 
on every compact set otherwise). In addition, this calculation shows that the 
function 


A* oy) = i; p (ey) ad(a), 


defined almost everywhere, is locally integrable and that as was seen above, 


lv(f)| < f+ oW)l-F@ldy< IAN MF llallellp 


for all f € L(G). Corollary of theorem 30, n° 18 finally shows that A * yp € 
L(G) and satisfies (15), qed. 

If U,(a#) denotes the representation of G by left translations on L? [n° 15, 
(ii)], the function y + y(x~1y) is just Up(x)y. However, the general definition 
4 


U,(A)a = | U,(x)a.d\(x) 


applies to the representation considered on L?(G). It is, therefore, “obvious” 
that, in this case, 


(25.17) UpA)yp =A*Y 


for all y € L?. This is correct, but the left hand side is the integral of the 
function « ++ U,(x)p with values in L?, defined by the general method of 
n° 4, (i), whereas the right hand side is defined in a completely different way. 
Hence a proof is needed. 

For the convenience of calculations, generally speaking, we set 


(yl) = [ ewe@ae ifpe Ll’, pelt. 


To show that y = 0, it suffices to show that (y|f) = 0 for all f € L(G). So 
the proof reduces to checking that 


(Up(A)p— A*glf) =0 for fe L(G). 


As p+ (y|f) is a continuous linear functional on L?, [n° 4, (i)] 


( fe »(@)p.dX(z)|f) = fu o(e)olf) dd(e). 


Hence by definition of U,(z), at least formally, 
(f Upteielsante) = farce) [ (ey) Fay 
= [Fea f(y) dX(x) 
= | d* ol) Foldy = *slf). 


§ 7. The Commutative Fourier Transform 181 


This calculation is justified by LF since we already know that under the 
assumptions made, the function f(y)y(a~+y) is integrable with respect to 
dX(a)dy. Hence (17). 


As U(A x uw) = U(A)U() in any representation, (17) shows that 
(A * 1) * p = A* (wy) 


for all bounded measures \ and y and the function y € L?(G). This also 
follows from (8). 

Finally, the map ys +> ji defined in (10) can be restated when du(a) = 
p(x)dx. Clearly, 


where 
(25.18) P(x) = g(a!) /A(x), = (a1) if G is unimodular. 


The map ¢ +> ¢ is isometric on L1(G), and even on all L?(G) if G is uni- 
modular. Relation (12) shows that 


(25.19) (Ax y) =G*X. 


(iv) Convolution of two functions. If d\(a) = p(a)dx and du(x) = w(a)dx 
are absolutely continuous, whence d|\|(«)d|u|(y) = |p(x)w(y)|dady, then the 
existence of the convolution product means (theorem 25) that f(xy) yp(x)w(y) 
is integrable over G x G for all f € L(G). Besides, it would be enough to 
check it when f, instead of being continuous, is the characteristic function of 
a compact set K C G, which would be reflected by the condition 


ie \o(x)v(y)| dady < +00. 
Then 
vif = ff tenetowordedy = ff radelenw(o-?tedy. 


Hence, if the measure v is well-defined, the function y +> y(axy)w(y~") is (LF) 
integrable for almost all 2 and its integral is a locally integrable function of 
x since its product with any f € L(G) is integrable. This gives the definition 
of the convolution 


(25.20) p* (x) = f eee av= [ eyo a)ay 


of two locally integrable functions and the formula 
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(25.21) u(t) = f Flo)se* We)de- 


Instead of y, it is possible to take y~! in (20) provided dy = dyy is replaced 
by dy~! = A(y)~'dy, a detail that should not be forgotten when G is not 
unimodular. 

There still remains to find assumptions under which integral (20) will be 
well-defined. This is obviously the case if one of the functions y, w is in L(G) 
and the other one is integrable, the result being a continuous function with 
compact support if this is also the case of y and w. Theorem 36 leads to a 
less trivial result: 


Theorem 37. Let G be a locally compact group. For all g € L(G) and 
w € L(G), 1 < p < +00, the function y & y(xy)W(y—!) is integrable for 
almost all x € G, the function 


[eeu ay = y * (2) 
is in L?(G) and 
(25.22) Ile * Yll> < Mella lp - 


If G is unimodular, the product vy * w exists when y € L?, w € LI with 
1/p+1/q = 1. To see this it suffices to verify that f(xy)y(x)w(y) is integrable 
over G x G for all f € L(G). The change of variable (x,y) 1 (z,x27~1y) 
reduces the verification to the function f(y)y(x)w(a~+y), and the change of 
variable (x,y) > (yz, y) to f(y)e(yx)w(a—*). The function x + ¢(yz) is also 
in LP for all y, and «++ w(a~") is again in L4(G) since G is unimodular. 
Moreover, the norm of these functions is the same as that of y and w. Hence 
J \e(yx)p(x~")|dx < ||yl|p||v||q, and so the double integral is convergent, 
with here too relation (19). 

For p = 1, (22) enables us to consider L(G), equipped with the convolu- 
tion product and the norm of L1, as a complete normed algebra — except that 
it does not necessarily have a unit element. Such an element would indeed be 
a function € € L+ for which ¢€ * f = f for all f, ie. for which 


| flevr (y~") dy = f(x) almost everywhere. 


But if f € L(G), both sides are continuous functions of x, so that the previous 
relation must hold for all x. So the measure e(y~+)dy is the initial Dirac 
measure. It can be defined by an integrable function only if G is discrete. 
Replacing L! by the product 


A'(G) =C x L(G) 


equipped with the obvious multiplication and the norm ||(a, f)|| = |a|+|| fla 
reduces the general case to a normed algebras with unit element ¢. This boils 
down to introducing measures of the form ade(x) + f(x)dx with a € C. 
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When ¢ and ~w are both integrable, relation (7) shows that 
(25.23) U(p*) =U(y)U(y) 


for any bounded representation (H,U) of G, i.e. such that sup ||U(a)|| < +oo. 
If (H,U) is unitary, (11) shows that 


(25.23’) U(y)" = U9). 


= [ Fue Hate*ew)ay = f oluvxyFFay 
since d(yx) = A(a)dy. For x = e, this gives the very useful relation 


(25.24) (pid) = e(* 9) 
which enables us to compute the inner product of L?(G), at least if y,w € 
L(G). Similarly 


y« p(x) = f een iy Yay = [ eeuoWAtay 
and so 


(25.24’) (y|) =e (¢ * 3) if G is unimodular. 


Therefore, the identity 
(25.25) e(f*9) =eg*f) for fg € L(G) 


characterizes unimodular groups. Measures satisfying it are said to be central 
and will play an important role in n° 31. 


(v) Dirac sequences . We saw how useful they were in Chap. V, n° 27 and 
again in Chap. VII in relation to Fourier sequences and integrals. In fact they 
can be defined in any locally compact space and, in the case of a group, they 
can be constructed using convolution products. Let us start with a general 
result: 
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Lemma 2. Let X be a locally compact space, a a point of X and (fin) a 
sequence?” of bounded complex measures on X. Assume the following condi- 
tions hold: 

(D 1) sup ||Hnl| < +00; 

(D2) lim [dpy,(e) = 15 

(D 3) for any compact neighbourhood V of a, 


(25.26) lim d|n|(x) = 0. 
xX-V 

Then 

(25.27) tim f f(2)dun(x) = Fl) 


for all Borel functions f, bounded and continuous at a. If the measures [in are 
carried by a fixed compact set K, (27) holds for any Borel function continuous 
at a and bounded on K. 


Dividing pu, by fn(1) which, by (D 2), tends to 1 reduces the proof to 
the case where p,(1) = 1 for all n. The function f being Borel and bounded, 
and hence integrable with respect to any bounded measure, 


Un(f) — f(a) = en Lf — f(@)] = un(9), 


where g satisfies the same assumptions as f, with moreover g(0) = 0. Then, 
for any compact neighbourhood V of a, 


pala) = f ate rannay+ f ole)din(a). 


The absolute value of the right hand side is bounded above by integral (26), 
up to the factor ||g||.., and so tends to 0 for all V. As g is continuous and 
zero at a, |g(x)| <r on V if V is sufficiently small. For such a neighbourhood 
V and sufficiently large n, (D 1) shows that the left hand side is < Mr, 
where M is a constant, whence (27). If there is a compact set K C X such 
that |u,|(X — K) =0 for all n, it is then unnecessary to investigate what is 
happening in X — K, and replacing ||g||.. by the uniform norm ||g||x on K, 
the argument remains the same, qed. 

It should again be noted that the proof equally applies to functions with 
values in a Banach space. This can prove useful as we will see. 

If (H,U) is a representation of some IcgG, this result can indeed be 
applied to the function « + U(a)a for all a € H, since such a function 


°? This countability assumption could be avoided by considering a family of mea- 
sures ty dependent on a sufficiently small neighbourhood V of e and by changing 
conditions (D 1) to (D 3) in an obvious manner. 
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is continuous and bounded on every compact set. As a result, if (un) is a 
Dirac sequence ata =e, then 


(25.28) limU(pin)a=a 


for allac H. 
Let us for example choose the representation given by left translations on 
L(G) and apply (17). We find that 


(25.29) lim?un*p=y forall ye L(G). 


If dun(x) = fr(a)dx, where the functions f,, are in L,(X), with integrals 
equal to 1, and for large n, vanish outside an arbitrary compact neighbour- 
hood of the unit element, then the conditions of lemma 2 hold. Thus 


(25.29’) Lim?f,k*g=. 


But in this case, the functions f, *«y are continuous. Indeed, for all f € L(X), 


feg(e)= / fley)oly7)dy. 


As x tends to a limit ao, the function y 4 f(ay) converges uniformly to 

y > f(xyo) while remaining zero outside a fixed compact set over which y 

is integrable. This enables to pass to the limit in the integral and so gives a 

systematic way of approximating any f € L? by continuous functions. 
These results admit several variants that one learns as and when required. 
Till n° 28, we will content ourselves with commutative Icgs. 


26 — Fourier Transform®®’ on L1(G) 


(i) Characters of a commutative lcg. Though generally speaking the algebra 
L*(G) does not have any unit element, we will say that a character of L'(G) 
is a homomorphism y from L'(G) to C satisfying y(€) = 1 if G is discrete; y 
may, on the contrary, be considered to be the trivial homomorphism when G 
is not discrete. I will denote by X(G) the set of these characters and by oo 
the trivial character. Every character y of L' then arises from the character 
OO, f)' 6 A+x(f) of the algebra A'(G) = C x L(G), and so is continuous. 
At the same time, X(G) is just the spectrum of the normed algebra A!(G). 


°8 All following results are already in André Weil’s book, but he used what was then 
known (L. Pontrjagin) on the construction of general commutative lcgs based on 
“classical” groups. Henri Cartan and the author’s article, Analyse harmonique et 
théorie de la dualité dans les groupes abéliens localement compacts (Ann. scient. 
de l’Ecole normale supérieure, 1947) gives a direct presentation of the theory 
starting from the Gelfand-Raikov theorem on the existence of irreducible unitary 
representations for every lcg and using functions of positive type which we will 
discuss in the next 8. 
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Example 1. If G = R, the Fourier transform immediately provides us with 
such homomorphisms, namely 


paqws / H(aelayae, 


where recall that e(a) = exp(27ia). If R is replaced by a Cartesian space EF 
with dual E*, then likewise, setting 


(26.1) e(x, y) = exp (277 (x, y)) , 


where (x,y) = y(x) is the value of the linear functional y € E* at x © E, 
every y € E* defines a character 


$2 7G)= / f(aele yde 


of L'(E), where dx is “the” Lebesgue measure on E used to define the 
convolution product 


frg(e) = i; fle — yg(u)dy. 


Lemma 1. Let y be a homomorphism from L(G) to C which is not iden- 
tically zero. Then there is a unique continuous function x(x) on G such that 


(26.2) x(xy) = x(@)x(y) 


and for which 
(26.3) xt) = f fax) 


for all f € L1(G). Then |x(x)| = 1 and conversely, every bounded and mea- 
surable solution of (2) defines a character of L'(G). 


The existence of a bounded and measurable function y(x) satisfying (3) 
follows from the duality of LZ? (n° 18, theorem 30). By definition of f * g, 
relation x(f * g) = x(f)x(g) can then be written 


[[ evteratudedy = [ [OWA gtv)aedy. 


This is in particular the case for f,g € L(G). If y is shown to be almost 
everywhere equal to a continuous function, it will obviously satisfy the first 
relation (1), |x(z)| = 1 following from the fact that, for any x, the set of 
numbers y(a)” = x(#") must be bounded. 
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But as the previous equality holds for all f,g € L(G), 


i a(y)xtey)dy = i a(u)xleyxtudy = xC@)x(9) 


necessarily holds for almost all «. The left hand side is the convolution of a 
locally integrable function and of a continuous function with compact support 
y + g(y—*). So it is a continuous function of x. Thus the result follows by 
choosing g in such a way that y(g) 4 0. The converse is obvious, qed. 

A bounded, continuous solution of (2) which is identically zero is, by 
definition, a character of the group G; G will denote the set of characters of 
G. Hence 


_ G if G is discrete 
GU {co} otherwise . 


(26.4) X(G) 
Following André Weil, the elements of G are often written by letters such as 
Z, Y, g, etc, the value of the character at x € G being written (x, ) = (x). 
To avoid confusion with the notation of linear algebra — also found in Fourier 
transforms: example 1 above -, I will instead use the notation 


(26.5) e(x, x) = x(x) 


which recalls that of Chap. VII. 

Example 2. For G = R, we recover the exponentials e(ay) of Fourier trans- 
forms. There are no other solutions given the characterization of exponential 
functions by their functional equation (Chap. IV). In the more general case 
of a Cartesian space E, we clearly only get the functions e(x,y) of exam- 
ple 1. If G = T, the multiplicative group of complex numbers with absolute 
value 1, we obtain the exponentials e,,(u) = u” of the theory of Fourier series 
(Chap. VII, § 1), etc. 

As the product of two characters is again a character, and so is the inverse 
of a character, multiplication defines a commutative group structure on es 
Hence 


(26.6) e(zy, x) = e(x,x)e(y,x), e(@, xx’) = e(x, x)e(z, x’), 
(26.6’) e(x, x)" =e (2, x") =e(z,y), Je(z,x)|/=1. 


Example 3. For G = R, the characters are parameterized by real numbers 
and multiplication in G becomes addition in R. For G = T, they are param- 
eterized by n € Z and G is the additive group Z. For G = Z, the characters 
are functions n + u”, where u € T, and, as a group, G is isomorphic to T. 


oN 


By the way, note that in these three classical cases, G = G provided G is 
equipped with the usual topology. 
Exercise 1. Determine G when G is the multiplicative group C*. 
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In the light of these examples, in the general case, we associate a Fourier 
transform 


(26.7) Fo) = | tee, xae 
to all functions f € L'(G). By definition of characters, 


(26.7’) (x) = x(f) 


and so, trivially, fg = f g and fF =f. Also note the very useful formula 
(26.8) Fe x(o) = f Bly)x (ey) dy = Foote) 


valid for all f € L'(G). 


(ii) The topology on the dual group. In the standard theory, the Fourier 
transform of an integrable function is continuous and tends to 0 at infinity 
(Chap. VII, n° 27, theorem 23). To generalize this theorem of Riemann- 
Lebesgue proved in Chap. VII by methods special to R, we are going to 
transform it into the definition of the topology of G. We will thus avoid 
proving it, even if it means checking whether this topology is indeed the 
usual one in standard cases. 

Thus first of all it means equipping G with a topology that will make the 
Fourier transforms f(x) = x(f) continuous. The simplest way of proceeding 
is to do the bare minimum to ensure this result. However, G only differs 
from the spectrum X(G) of A1(G) by the possible addition of the character 
(A, f) + A. So the solution is to equip G with the weak topology of the dual 
of A1(G), ie. of the dual L® of L!: a neighbourhood of a character yo € G 
must contain the intersection of a finite number of subsets of G each defined 
by a relation of the form 


n~ 


Ix(f)-xolfl<r, ie. |FO)-flxo)| <r. 


Now, the set X(G) of characters of A‘(G) is compact with respect to the 
weak topology (n° 20, end). Since by lemma 1 or relation (4), G can be 
identified with X(G) if G is discrete and with X(G) — {co} otherwise, G is a 
locally compact space in all cases (and compact if G is discrete, in which case 
it is not necessary to check that the Fourier transforms tend to 0 at infinity!). 
If G is not discrete, the compact subsets of G are the closed subsets of X (G) 
which do not contain 0. Now, any relation of the form lf(x)| >r with r >0 
defines a closed subset of X(G) which does not contain 0. Thus relation 


FOO! > r defines a compact subset of G, and so the function f tends to 
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0 at infinity? on G as expected. This result, trivial except for vocabulary, 
explains the notation co. 


Lemma 2. The weak topology is identical to the topology of compact con- 
vergence on G, t 


(This shows that, for G = R, this is the usual topology of R). First, 
compact convergence implies weak convergence in the unit ball of L°°. Indeed, 
for f € L' and y, yp in the unity ball of L©, 


i: f(a)p(a)de — [ f()po(w)de 


< [ LF(2)| ow) — go(a)| ax 
= f \s@-Io(@) - eo(a)lae + f 


G-K 
for every compact set K C G. For all r > 0, K can be chosen so that the 
integral of |f| over G—K is < r. Then the latter integral is < 2r for all y and 
yo. If moreover |y(x) — yo(x)| < r on K, in other words if ¢ is sufficiently 
near Yo with respect to the topology of compact convergence, the former one 
is < r||f||1. Hence the result. 

Conversely, to prove that, weak convergence in G implies compact con- 
vergence, we use a general result: 


Lemma 38. Let E be a Banach space, E" its topological dual, B’ the unit 
ball? of E' and K a compact subset of E. Weak convergence in B' is equiv- 
alent to uniform convergence in K. 


We need to show that, for all fo € B’ and all r > 0, there exists a 
neighbourhood V of fo in B’ equipped with the weak topology such that 


fev = |fx)—fo(x)|<r foralxe kK. 


Kk being compact, for all r > 0 there are finitely many x; € K such that 
the balls B(x;,r) C E cover K. By definition, the set of f € B’ such that 
| f (xi) — fo(x:)| <r for all i is a neighbourhood V of fo with respect to the 
weak topology. For all x € K, there exists i such that ||x — x;|| < r. For 
f EV, allof 


If i) — folxil<r, If (x)- fe) <r, |folx) — fo (x) <r 


°° Recall that a function f defined on a locally compact space X tends to 0 at 
infinity if the set |f(a)| > r is contained in a compact set for all r > 0. Taking 
the Alexandrov compactification X U{oo} of X and setting f(co) = 0, this means 
that f is continuous at oo since its open neighbourhoods are by definition the 
complements of the compact subsets of X. 

100 Tf F is a Banach space, any continuous linear functional f on E has a norm 


f\| =sup|f(#)| /\lall, 


so that the set E’ of these linear functionals is also a normed space, which is 
obviously complete. 
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hold. The last two inequalities follow from the essential assumption that 
I| fll < 1. Thus 


f eV = |f(x) — fo(x)| < 3r 


for all x € K, qed. 
Coming back to the proof of lemma 2, as a general rule let us set that 
fe(y) = f(xy). Then relation (8) becomes 


(26.9) Fodx(a) =f exe y= fr floy)x@)dy = xe) 


As fz is the image of f under the translation operator U(a~') on L', the 
map x +> f, from G to the Banach space L1 is continuous [n° 15, (ii)]. 
For any compact set K C G, the set Ky of fz, where x € K, is, therefore, 
a compact subset of L’. Hence, if a varying x € @ converges weakly to a 
limit yo € G, lemma 2 then implies the compact convergence of functions 
fO)X(@ ) to fore ). As fix ) tends to F(x) and as f can be chosen so 
that F (x0) # 0, we conclude that y(xz) converges uniformly to yo(x) on K, 
proving lemma 2. 

This result shows that the topology of G is compatible with the group 
structure of G. ee 

The set of functions f is an algebra of continuous functions on X(G) 
since feg= = 3 g. Replacing f by Z replaces f by the conjugate function. The 
functions f separate the points of G. It would even be enough to only consider 
f € L(G). Functions of the form y ++ f(x) + A, where \ is an arbitrary 
constant — i.e. the Gelfand transforms of the elements of A'(G) —, separate 
the points of X(G). Hence they enable us to verify that the assumptions 
of the Stone-Weierstrass theorem hold. As a result, Functions of the form 
f(x) + are everywhere dense in the space of continuous functions on X(G). 
But to approximate a function vanishing at 0 = oo, i.e. a function of Leet), 
there is no need to add constants. Thus: 


Lemma 4. Fourier transforms of f € L'(G) are everywhere dense in the 
space Lo.(G) of continuous functions tending to 0 at infinity on G. 


We have already seen that if G is discrete, then Gis compact. Let us show 
that if G is compact, G is discrete. To this end, we use the orthogonality 
relations 


(26.10) [or wae SA) at eS ys 0 ifx 4x’ 


for the characters of a compact group, already encountered in Chap. VII 
when G = T. This assumes that the invariant measure of G is normalized, 
thus making (10) obvious for y = y’. For y # x’, we compute 


x' * x(x) = J (cy~*) x(y)dy = x'(x) (xIx’) - 
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As xX * x = x¥* x’, xX/(2)(xlx’) = x(%)(xlx’), and as two characters are 
proportional only if they are the same, the expected result follows. This being 
settled, it remains to check that the left hand side of (10) is a continuous 
function of (xy, x’) on G x G. This is clearly since the topology of G is that 
of uniform convergence in G. 

To summarize: 


Theorem 38. Let G be a locally compact commutative group. The weak 
topology of the dual of L' coincides with the topology of compact convergence 
on the multiplicative group @ of characters of G. The Fourier transform of 
every f € L'(G) is continuous and tends to 0 at infinity on G. Any continuous 


function tending to 0 at infinity on G is the uniform limit of functions i 
where f € L'(G). 


(iii) The canonical homomorphism G —> GC: Since, for given x, the func- 
tion x H x(x) = e(x,x) is continuous, by (6), it is a character of G, and 


hence an element of G. This gives a homomorphism of groups 
(26.11) ESE 

for which, by definition, 

(26.12) e(x, x) = elx, j(2)] - 


In this relation, the left hand side refers to the duality between G and CG; 
the right hand one to the duality between G and G. Let us show that j is 
continuous, in other words that, if some varying x € G converges to some 
a € G, then e(x, x) = x(x) converges uniformly to x(a) on every compact 
set kK CG. 
Indeed, by (9), 
|Fodx(@) — FOdx(a)| = IxUe — fall $ Ife — Falla 

for all x € G and f ¢ Li. Asa fp is continuous, the right hand side 
is arbitrarily small for x sufficiently near a. So, for all r > 0, there is a 
neighbourhood V of a such that x € V implies 


n~ n~ 


Fodx(@) — FOOx(@)| <r for all x € G. 
To deduce that, for all r > 0, 
ly(z) —x(a)|<r forall ye kK 


if x is sufficiently near a, it suffices to show that there exists f € L1 such 
that 
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(26.13) ink |Foo| >0. 


This is computation rule (R 3) of Chap. III, n° 7 concerning uniform con- 
vergence. But using the function f(#) defined at the end of n° 24, (iii), a 
trivial calculation shows that the Fourier transforms of f and f are mutually 


conjugate. As a result, 


(26.14) f= heh ford 


n~ 


This said, for all y 4 0, there exists f such that f(y) 4 0. One may assume 
that |f(x)| > 1 in the neighbourhood of x, and so cover the compact set A 
with finitely many open sets {| fi(x)| > 1}. The function f = > fi * f; is then 


>1on K, qed. 


27 — Fourier Transform on L?(G) 


(i) The algebra A(G) and its characters. We consider the regular represen- 
tation (H,U) of G,where H = L?(G) and where U(a) transforms f € H 
into 


(27.1) U(x) f :y— f (a*y) . 


As was seen in (25.17), for f € L1, the integrals 


(27.2) U(N) = f Ve)f@de 

are the convolution operators 

Cis) U(f\g=f*g 

on H. They satisfy 

(27.4): U(f¥g) =U), UF +9) =U +U) 
and also 

(27.5) Upreug). 


These formal properties show that the set A(G) of limits in norm of operators 
of the form U(f) +1 is a commutative GN algebra, not to be confused with 
the algebra A!(G) used above. The Fourier transform on L? will be obtained 
by setting out clearly the results of n° 22 in this case; this is in particular 
the case of Plancherel’s formula 


[ltr ar= | \FooPax, 
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which obviously assumes that the invariant measure dy of G has been con- 
veniently chosen. 

Let us begin with the characters x of A(G). Knowing them on an every- 
where dense subspace is sufficient, for example on the subspace of operators 
U(f) + Al, and as y(A1) = 4, it is sufficient to know y on all U(f). Since 
|U(f)|| < \Flla, every character y of A(G) defines a character of L1(G), 
which will also be denoted by y, such that 


(27.6) xX(f) = fx) = xlUA)] - 


This leads to an injective map from A(G) to X(G) which, as will be seen 
later, is a homeomorphism. 
For the moment, let us draw an important consequence of (6): 


Theorem 39. The Fourier transform fH i f € L(G), is injective. For 
any pair of distinct elements x,y of G, there is a character x of G such that 


x(x) # x(y). 


If f(x) = 0 for all y € G, then, by (6), y[U(f)] = 0 must perforce hold 
for any character x of A(G). But in a GN algebra, the Gelfand transform is 
injective (theorem 33). Hence U(f) = 0, ie. f *g = 0 for all g € L?. Thus 
replacing g by a Dirac sequence and using (25.29) gives f = 0. 

The second proposition, obvious for the groups mentioned in the examples 
of the previous n°, reduces to showing that for x 4 e, there exists . such 
that x(x) 4 1. But (26.9) can also be written 


(27.7) X(FA)x(x) = x (fe) ; 


where f(y) = f(x~ty). If x(x) = 1 for all y, then y(f — fr) = 0 for all 
XE G and f € L(G), whence it would follow from the first proposition of 
the theorem that f — f, = 0 for all f € L(G), an absurdity if « # e. 

The map A(G) — X(G) defined by (6) is clearly continuous with respect 
to the weak topologies of the duals of A(G) and L?. As it is injective, it is, 
therefore, a homeomorphism from A(G) onto a compact subset of X(G). In 
fact: 


Lemma 1. The map A(G) to X(G) is surjective. 


The proof consists of two parts, the first one comprising computations 
without any immediately clear connections with the statement. 
(a) We choose a character y € G. For all f € L', the function 


x Foot) = f sexo’ eae 
is the Fourier transform of the function 


(27.8) VO) fF a> flx)x(@) = f(a)e(a, x). 
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Thus, considering y’ a linear functional on L?, 


(27.9) xX Vodfl = fcr’). 


Operator (8) of multiplication by a function with absolute value 1 can be 
defined on every L? and in particular on L?, where it is unitary and satisfies 


Vix =V@)=VON™: 


In fact the map e y +> V(x) is a unitary representation of G on L?(G). 
The product of V(y) and some U(f) can be easily found: 


VU (Nate) = x(o).F«9Ce) = x(a) fF (ey) 
= fe (xy~*) f(ay")x(y)g(y)dy. 
This function is obtained by applying the operator V(x) to g, then the con- 


volution operator by x(2) f(x), i-e. by function (8), on the function obtained. 
As a result, V(y)U(f) = U[V (x) f]V (x), ie. 


(27.10) U(V(x)f] =VO)U(f)VO)71 for all ye G. 
It follows that 
(27.11) T € A(G) = VQ)TV(x)" € A(G). 


This is shown by (9) for T = U(f), hence for T = Al + U(f), and in the 
general case can be deduced by passing to the limit in norm. 

(b) We are now ready to prove the lemma. Let y’ be a non-trivial character 
of A(G) on Lt. Map (11) being a ring homomorphism, by (10), the map 


x": TH+ x! [V(X)TV (x) ] 


is also a non-trivial character of A(G) on L'. Identifying y’ and y” with the 
corresponding characters of L', if T = U(f), then (10) shows that, for all 
x € G, V(x)TV(x)~! corresponds to the function V(x) f. So by (9), 


MA =x Vod=F Ox). 
Hence if a character y’ of G stems from a character of A(G), so does y~! x’ 
for all x € G. As aresult, any character of G stems from a character of A(G), 
qed. 
Exercise’! 1. Show that the set of operators uV(y)U(x), where wu is a 
scalar with absolute value 1 and where x € G, x € G, is a group with the 
usual multiplication. What the group law is thereby obtained on T x G x G? 


101 Trivial starting point from which in 1964 André Weil drew far less trivial con- 
sequences (Collected Works, volume 3, pp. 143-211). For G = R, the question is 
related to the Heisenberg group, i.e the set of real 3 x 3 upper triangular matri- 
ces with 1’s on the diagonal, whose Plancherel formula I had computed in 1949 
(Journal de Liouville, 1951). Dieudonné transformed part of Weil’s article into 
exercises 1 to 8 of XXII.15. 
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(ii) Spectral decomposition of the regular representation. We are now able 
to apply the general constructions of n° 22 to the algebraA(G). Before doing 
so, we set out clearly the GN theorem in this case. 

For all T € A(G), by lemma 1, the Gelfand transform of the general case 
can be identified with a continuous function T(x) on X(G) such that 


(27.12) To =fo) if T=U(f), 


n~ 


on the understanding that f(0) is set to be equal to 0 if y is trivial on 
U(f) for all f € L'(G). The GN theorem tells us that the map T 1 T is an 
isomorphism for all structures. The functions 7 are, therefore, the continuous 
functions on X(G), i.e. functions of the form y+Cte, where y € Est), the 
Fourier transforms of f € L'(G) being functions belonging to PkG) 

On the other hand, n° 22 enables us to associate to any pair of functions 
p,q © H = L*(G) a measure fy, on X(G) such that 


(27.13) (Sp|T) = sk btn ald 


for all S,T € A(G). For S = U(f) and T =U(g) or 1, $= f and T =Gorl. 
Thus relations 


(27.14) (f*plg*q) = / FOodTOO Ap, g (0) 
(27.14) Gi0= i Flo) attp q(x) 


which hold for all f,g € L*. All these integrals are a priori extended to X(G) 
and not only to G. 


These measures enable us to associate an operator M(y) on L?(G), given 
by 


(27.15) (M(y)plq) = I oe POMralXd 


to “every” bounded (for example Borel) function y on X(G). These opera- 
tors commute with all T € A(G), in particular with all U(f), and commute 
pairwise as in all spectral decompositions. As was seen in n° 22, 


Mag + Bb) =aM(y) + BM(v), 
(27.16) M(ev) = M(y)M(v), M(%)=M(y)*, 


and also, by (14’), 


n~ 


(27.17) M(f)=U(f) for all fe L’. 
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In particular, any reasonable subset w C X(G) defines a projection oper- 
ator M(w) on L?. Computing the operator M associated to the product of 
the characteristic function of w and a bounded function y shows that 


(27.18) (M(w)M(y)plq) = / (x) dptp.9(X) 


Ww 


for all bounded y and p,q € L?(G). Let us for example take w = {oo} 
by assuming that G is not discrete. For y = 3 the right hand side of 
(12) is f(co)up.q(w) = 0. Hence, as in this case M(y)p = U(f)p = f * p, 
(M(w)U(f)plq) = 0. So q € L? being arbitrary, this proves that, for all 
functions f € L' and p € L?, f * p is a zero of M(w). But in L?, every 
function p can be approximated by such convolutions using Dirac sequences. 
So M(w) = 0, which shows that {oo} is null with respect to all up4. Hence 
in (18) it is enough to integrate over G or, equivalently, to regard all ftp.q as 
bounded measures on G. 

One can also choose the function x + x(a) for some x € G. It is contin- 
uous and bounded. Denoting by M(x) the corresponding operator, 


(M (1c) M(y)pla) = / (2) (x) dptp a(x) 


for all bounded and Borel y on G: in particular for y = 1. Let us show that 
M(x) =U(a2)~!. For y = f, a trivial calculation gives 


xe) Fd = ‘| HOMOLE 


This is the Fourier transform of the function U(x)~!f = fr : yo f(xy). So, 
as M(y) = U(f) in this case, 


M(x)U(f) = Uf) = / f(ay)U(y)dy = U(a) Uf), 
whence M(x) = U(x)~!. Therefore, 


(27.19) (U(«)M(¢)plq) = i eC, x(x) dbp g(X) 


for any bounded and Borel function y on G; especially for y = 1. 


(iii) The invariant measure on the dual. To simplify notation, set 
=" (Gata) alta); 


A confusion with the space L? for p = 10 +2 = 7+5 being unlikely. The 
Fourier transforms of f € L'” are everywhere dense in L..(G) since L!? 
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nN 


contains L(G). The set they form is also a subalgebra of L..(G) with respect 
to the usual multiplication, since L!? is a subalgebra of L' with respect to 
the convolution product thanks to relations 


ted? Cay, TP Rie eT: 


The measures /1,q associated to the functions of L'” satisfy an identity that 
will be the starting point of the proof of Plancherel’s theorem: 


Lemma 2. For all f,9,p,q € L'*(G), 


(27.20) F0dGCOdtp,a(X) = PODTOO dtep,g (00) - 


As (20) is an identity between two measures, we need to show that 


(27.20°) / OX) FOTO dHty,g(x) = i (BO TOD ates 90) 


for all y € L(G) or, more generally, Borel and bounded. This means that 


(M(vf 9)pla) = (M (v4) f\g) 


or, by (16) and (17), that 


(M(p)U(f)plIU(g)a) = (M(e)U (vp) FU (a9) - 


This is obvious since, under the assumptions of the lemma, 


U(f)p =U(p)f =f *p 
because G is commutative. 


Lemma 3. For p,q € L'*(G), the measure djty,q does not depend on the 
function p * q. 
Indeed by (17), under the assumptions of the lemma, for f € L(G) 


n n~ 


Mp,a(f) = (M(f)pla) = (U(f)pla) = (f * pla) - 
So by (25.24’), 


n 


Mp.q(f) = (f *p) * Ge) = f *(p*q) (e), 


associativity being justified since f, p and q are integrable. As f determines f 
(theorem 39) and as all f are everywhere dense in L..(G), the bounded 
measure {p,q only depends on p * g, which proves the lemma. 
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Theorem 40. The invariant measure on G can be chosen in such a way 
that 


(27.21) ditp,q(x) = POdax)dx 
for all p,q € L'*(G). 


To simplify calculations, I will denote by A the set of functions of the 


form P(x) = p(x)¢(x) on G, where p,q € L'?(G). Lemma 3 enables us to 
associate the measure 


dup(x) = dpp,q(x) 
to such a function P. Then the following properties hold: 


(Al): PQ€A for all QE A; 
(A 2): for all P,Q e€ A, 


(27.22) / P(x)dyiq(x) = / Q(Kdaurld; 


(A 3): for every compact set K C G, there exists P € A such that 
P(x) £0 for all y € K. 


The first one follows from the fact that L'?« L!* Cc L!*. The second one is 
just lemma 2. To obtain the third one, observe that, the Fourier transforms 
of f € L(G) being everywhere dense on L.(G), there exists p € L(G) such 
that |p(x)| > 1 on K. The function P(x) = |p(x)|? answers the question. 

This being settled, let us show that there is a measure pz on G such that 
dup(x) = P(x)du(x) for all P € A, ice. 


(27.23) up(y) =pu(yP) for ye L(G) and PE A. 


As (22) can be formally written as 


ea P(x) 'dup(x) = Q(x) *dua(x), 


“obviously” the measure P(x)~dup(x) does not depend on P, which gives 
the expected measure jz. But both sides of (*) are well-defined only if the 
functions P and Q do not vanish.!°? Arguments, therefore, need to be less 
brutal. 

Therefore, let us associate the open set 2p = {P(x) £0} to each PE A 
and identify [(Qp) with the subspace of f € L(G) whose support is contained 
in 2p. We define a measure vp on {2p — and not on Ge setting 


102 The reader desiring an easy proof for G = R” will observe that, in this case, the 
function p(x) = exp(—7||z||?), obviously in L’*, does not vanish and is equal to 


nN =~ 


its Fourier transform (here G = G). For P(x) = p(x)pOd = p(x)”, the left hand 
side of (*) is well-defined. Denoting it by du(x), both sides of (14) can then be 


divided by P(x), and so dutp.q(x) = B(x)aO)du(x) for all p,q € L"*, ged. 
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ee i OOP (Xd) dup (x) 


for all y € L(Qp). This is well-defined since the function being integrated 
vanishes outside a compact subset of 2p. 'To show that the result does not 
depend on P, we consider another function Q@ € A and suppose that the 
support K of y is contained in QpNQg. As PQ does not vanish on NPN Qa, 
yp = WvPQ for some w € L(X, kK). Thus by (22), 


Yp(~) = UP(WQ) = Ha(WP) = va(¥). 


As (A 3) implies that the open sets 2p cover G, the existence of jz will 
be a consequence of a general result which enables us to construct a measure 
by “gluing” together measures defined on open sets: 


Lemma 4. Let X be a locally compact space and (Q;); € I an open cover of 
X. Suppose that a measure ju; is given on each 2;. There is a measure ps on 
X such that 


(27.24) uf) = Wilf) 


for all f € L(X) with support contained in Q; if and only if, for any ordered 
pair (i,j), 


(27.25) Wilf) = 4; (f) 
for all f € L(X) with support K C 2; 2;. 


(25) is clearly necessary. If this condition holds and if some f € L(X) 
vanishes outside a compact set K, there exist p; € L(X) whose supports are 
contained in the sets 2;, which are trivial for almost all 7, and such that 
>i p; = 1 0n K. If some gq; € L(X) satisfy the same conditions, then by the 
Lemma’s assumptions pu;(fpiq;) = 1;(fpiq;). Hence 


do MiP) = > Hil fPids) = dos (fP:95) 3 does (Fa5) 


Therefore, the expression p(f) = >> i(fp;) only depends on f. It is the 
expected measure [. . 

We have thus shown that there exists a (unique) measure 4: on G such 
that 


(27.26) (M()pla) = ttyl) = i p(x) BOOTED A(x) 


for all functions y € L(G) and p,q € L!2(G). It is positive since the left hand 

side is > 0 for py > 0 and p = q. The invariance of 4 remains to be shown. . 
First of all, (26) means that the measures djip.q(x) and p(x) G(x)du(x) are 

identical for all p,q € L'?(G). Since the measure djtp,»(x) = |P(x)/Pdue(x) is 
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bounded, this shows (n° 18, corollary of theorem 30) that p € L(G; jt) for 
all p € L'*(G). Formula (26) thus holds for any bounded continuous (and 


even Borel: theorem 25) function y on G, and in particular if y(v) = f(x) 
for some f € L'*(G). In this case, (26) becomes 


(27.27) Go= / FooBOdTOO dH) 


for f,p,q € L'*. As in general 


D(xxo )= [owe (x, xx9")de = [ vleyrole)e@, var, 


making a translation act on the function being integrated in (27) amounts 
to multiplying f, p and q by the same character yo de G. Denoting this 
operation by fH f’, 


fi xp'(x) = fic (zy*) f (cy~*) xo(y)ply)dy = xo(2).f * p(2), 
ie. (f xp)’ = f’ * p’. Therefore, since f +> f’ is obviously unitary, 


(27.28) (f' * p'lq') = (f * pla) - 


This proves that integral (27) does not change if the integrated function is 
made to undergo a translation x ++ xx9 ' or else that any image p! of pu 
under a translation also satisfies (27). So 


i FooPOTOOAH(d) = / FooBOTOO AH (x) 


for all p,q € L!2(G). As the functions f are everywhere dense on L(G), the 
equality 


P(X~)IOD A(X) = DONT) du’ (x) 


between bounded measures follows, whence 4 = p’ because of (A 3). This 
completes the proof of theorem 40. 


Corollary 1 (Plancherel). There is a unique isomorphism f > f from 
L?(G) onto L?(G) which reduced to the Fourier transform on L'*(G). 


The proof reduces to showing that the set of f, f € L", is everywhere 
dense on L?(G). Otherwise, there is a non-zero function F € L?(G) orthog- 
onal to all f, in particular to all P € A used in the proof of theorem 40, 
whence 


[ Foop00TO.ax = 0 
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for all p,q € L!2(G). As F and are in L?(G), the measure F(x)p(x)dx is 
bounded, and the previous relation becomes (theorem 25) 


: FOO)-FOOBO)dx = 0. 


ny 


The functions ¢ being everywhere dense on L5(G), the measure F(x)p(x)dx 
is zero, and as the open sets {p(v) 4 0} cover G, the same holds for F'(y)dx, 
whence F'(x) = 0 ae., qed. 


Corollary 2. The Fourier transform of the product fg of two square inte- 
grable functions is the convolution f *g of their Fourier transforms. 


Let us prove this by replacing g by its conjugate. The Fourier transform 
of the convolution is the function 


x f foglweex)de. 
That of the function g()e(x; x) is clearly the function 
baa Ne 8 aa 


Plancherel’s formula thus shows that the Fourier transform of fg is 


qed. 


(iv) Fourier inversion formula and biduality. 


Theorem 41. Let f be a function belonging either to L1(G) or to L?(G), 
and whose Fourier transform is integrable ; then 


(27.29) f(x) = [een Food almost everywhere 


and the right hand side is a continuous function on G. 


(a) If f € L(G), then the operator U(f) : g + f *g is in the algebra 
A(G) and 


(27.30) (U(F)pla) = / FoopodTOOax 


for all p,q € L?(G). Hence, if p,q € L(G), then 


(U(f\plq) = / | FoopodaT@ela; x)dxdy , 
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the function being integrated being in L!(G x eg) since fand q are integrable 
and p bounded. If this result is applied to the functions p € L,(G) with 
integrals equal to 1 and vanishing outside smaller and smaller neighbourhoods 
of e, then the function p(x) converges uniformly to 1 on every compact set, 
with |p(y)| < 1. Hence passing to the limit in the integral is possible. On the 
left hand side, U(f)p = f * p converges to f in L1(G), and as q is assumed 
to be in L(G), passing to the limit on the left hand side is possible. So 


[fon atayae = ff Fates e(x; y)dedx 


for all g € L(G). The double integral does not pose any problems because of 


the assumptions on f. Thus the inversion formula readily follows. 
(b) Case where f € L?(G). For all g € L(G), (Plancherel) 


f*5(e) = (flU(e 9) = | Fao Aye 


Let us apply this identity to some gn > 0 with integral 1 and converging 
to the Dirac measure at e. The function appearing in the integral tends to 
f(x)e(x, x) while remaining < |f()|. Therefore, the second integral tends 


(dominated convergence) to [ f(xje(z, x). On the left hand side, 
f* In(x) = U(Gn) fF 

and so, by (25.29), 
f =lim? f * Gn(x). 

The result then follows from Riesz-Fischer. 


Theorem 42. The canonical homomorphism j :G— G is bijective. 


The Fourier transform on G is an isomorphism f > f from L? (G) onto 
L?(G). Likewise, the Fourier transform on G is an isomorphism onto L? G ) 
mapping f onto a This gives an isomorphism f +> ? from L?(G) onto Re , 
If fe L12(G), the functions f and . are continuous and, by the previous 


theorem, coincide on G if the Fourier transform on G is defined accordingly. 
The theorem will follow once we have shown that, under the _assumption 


GH G, there exists a function f 4 0 such that f € L!2(G) and f =0 on G. 
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For this purpose, we consider two functions p,q € L(G) and the corre- 
sponding functions y,~ € L?(G). Plancherel’s formula for G (corollary 2 


above) shows that p * g, calculated in CG. is the Fourier transform of yw. 
This function is integrable and, since the Fourier transforms of y and w are 
integrable, y and w are given by the inversion formula applied to G. As a 
result, y and y are continuous and bounded, and so yy € L!? (G). By the 
previous theorem applied to G, the function f € L?(G) such that f= py is, 
therefore, continuous on G and f =pxqg IGF G p and q can easily be 


chosen in such a way that p * q is zero on G, but not on all of G: if A and 
B are the supports of p and gq, it suffices for the intersection of AB and the 


closed set GC G to be empty. This condition is satisfied by choosing A to 
be a disjoint compact subset of G with non-empty interior, and B to be a 
sufficiently small compact neighbourhood of the unit, qed. 

Exercise 1. Generalize the Poisson summation formula. 

Exercise 2. Show that 


Apty q(x) = p(x)q(a)dx 


for all p,q € L7(G). 
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28 — Further Representation Theory 


As was mentioned in n° 15 and 23, a representation of a leg G is a homomor- 
phism «+> U(x) from G to the group of invertible continuous operators of a 
Banach space H such that the map 


xr+U(a)a 


is continuous for all a € H. n° 25, (i) enabled us to associate to every complex 
measure js on G' a continuous operator U(j) given by 


(28.1) U(pja= [eu @adute), 


at least if [ ||U(x)||d|uu|(x) < +00, an empty condition when y has compact 
support, and in particular when du(x) = f(x)dx for some f € L(G). The 
corresponding operator is then denoted by U(f). n° 25 shows that 


(28.2) U(n*v) = U(u)U(v) 


if these operators are well-defined, and especially if 4 and v have compact 
support. In particular, 


(28.2’) U(f *g) =U(f)U(g) 


for all f,g € L(G). The next result, where we only consider functions f € 
L(G), though easy is very useful: 


Lemma 1. Let (H,U) be a representation of G. Every a € H is the limit 
of vectors of the form U(f)a. A closed vector subspace of H is invariant 
under all U(a), if and only if it is so under all U(f). A continuous operator 
commutes with all U(x) if and only if it commutes with all U(f). If (H,U) 
is unitary, the set of U(f) is everywhere dense with respect to the ultrastrong 
topology on the von Neumann algebra they generate. 


The first proposition was proved at the end of n° 25: given measures of the 
form f,(x)dx, with f, > 0 having integral equal to 1 and vanishing outside 
compact on decreasingly small neighbourhoods of e, lim U(f,)a = a for all a. 
This also implies the last proposition: indeed if A denotes the von Neumann 
algebra generated by all U(f), i-e. the set of operators which commute with all 
operators commuting with all U(f), then [n° 19, (vii)] the set of U(f)+A1 is 
known to be everywhere dense in A with respect to the ultrastrong topology; 
to show that so is the set of U(f), it suffices [n° 19, (vii), lemma 9] to check 
that 0 is the the only element of H mapped to 0 by all U(f), which is obvious. 
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On the other hand, the formula U(f)a = fU(x)a.f(x)dx shows that 
U(f)a is a limit of linear combinations of vectors U(a)a. Any closed subspace 
invariant under all U(x) is, therefore, invariant under all U(f). The relation 


U(a)U(f)a =U (x) U(f)a =U (ex * fa 


shows that any subspace, not necessarily closed, invariant under all U(f), is 
invariant under all U(z). 

If a continuous linear operator A on H commutes with all U(x), the 
linearity of the integral of vector functions [n° 4, formula (*****)] shows that 


AU (f) = [Arco tteyae - fe @arteae =U(f)A. 


Conversely, if A commutes with all U(f), for any « € G, we choose f,, such 
that the measure f,(y)dy converges to the Dirac measure at x. Then 


U(a)Aa = limU (f,) Aa = A.limU (f,)a = AU(a)a, 


qed. 
In the case of a unitary representation, formulas (25.10) and (25.11) hold 
for any bounded measure ju: 


(28.3) U(u)* =U (jt) ot f(x) = du (a4), 
(28.3’) U(f)* =U(f) f 


for f € L'(G). The set of operators U(f) is, therefore, a self-adjoint algebra in 
H. Conversely, a unitary representation of G can be obtained from operators 


U(f): 


Lemma 2. Let H be a Hilbert space and f ++ U(f) a homomorphism from 
the algebra L(G) to the algebra L(H). Suppose that 


U(f)* =U(F), UCAS lIfll. for all f € L(G) 


and that the relation U(f)a = 0 for all f € L(G) implies a = 0. Then there 
is a unique unitary representation «++ U(x) of G in H such that 


U(f) = [e@tteae for all f € L(G). 


First, the subspace Ho generated by all vectors U(f)a is everywhere dense 
in H because a vector orthogonal to Hg is a zero of all U(f)*, hence of all 
U(f), and so is zero. 

To define U(x) for x € G, we first do so on Ho. The formula to be used 
is obvious: 
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A small calculation immediately shows that the norms of the two sums are 
equal, which gives bijective and isometric U(x) : Ho —> Ho such that 
U(xy) = U(a)U(y). They can be extended to unitary operators on H. We 
check that the maps 7 + U(x)a are continuous and that the representation 
(H, U) satisfies the required condition by using the simplest properties of the 
convolution product on L(G). 

L(G) could be replaced by L1(G) in this statement. 

Both versions of Schur’s lemma proved at the end of n° 22, (iii) and in 
n° 23, (iii), corollary 2 of theorem 35 can be applied to irreducible unitary 
representations. The next version is also very useful: 


Schur’s Lemma III. Let (H,U) and (H’,U’) be two irreducible unitary 
representations of G and T : H —> H’' a continuous linear map such that 
TU(«) = U'(x)T for alla € G. If T #0, the two given representations are 
equivalent.103 


To prove proposition (b), observe that T*T : # —> H commutes with all 
U(«), and so is a scalar which can be assumed to be equal to 1. The map T is 
then isometric, and so its image is a closed subspace of H’. As it is invariant 
and non-trivial, T is bijective, qed. 

A trivial corollary: Every irreducible unitary representation of a commu- 
tative lcg G has dimension one, i.e. is a group character. Hence Plancherel’s 
formula can be regarded as a decomposition of the regular representation into 
a continuous sum of irreducible representations of dimension one: 


[ wesin = [, (Ue: Foolaco) ax. 
On the left hand side, U(x) is the translation operator which, in L?(G), 
transforms the function f(y) into the function f(2~'y), On the right hand 
side, we equip C with the inner product (u|v) = uv, Finally, for y € ey 
U(a;x) is the multiplication operator by y(x) = e(a#;x) on C. By the way, 
note that in the formula 


Foo) = f te)x@ax = f(x) flea 


F(x) appears as the operator U(f) associated to the conjugate representation 
xX of x. In the case of a non-commutative Icg, to obtain formulas compatible 
with the previous one, one would need to define operators U(f) by U(f) = 
f U(«~")f(x)dx. The “Fourier transform” of f would then be the function 
which associates to each class of irreducible unitary representations of G the 
corresponding operator U(f). But this definition of U(f) would lead to the 
relation U(f * g) = U(g)U(f) which, on a commutative group, assuredly 
reduces to the formula (f * gq) =f g, but which nobody wants in the general 
case. 


103 ie. there is an isomorphism from 1. onto H’ transforming U(x) into U'(2). 
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Exercise 1 (Gelfand and Neumark). (a) Let f be a measurable function on 
R. Show that the function (t, 7) H f(ta) is measurable on R* xR. (b) Suppose 
that, for all a £ 0, there is a null set N(a) such that 


f(ax) = f(x) for all a ¢ N(a). 


Show that f is almost everywhere equal to a constant [using property (C1) 
of n° 10, show that, for almost all x, f(ax) = f(x) for almost all a]. (c) In 
L?(R), consider the unitary operators 


U(a,b) : f(x) - al/?+" fax + b) 


with given ¢ € R, and varying a € R*, b € R. Using a Fourier transform show 
that any continuous operator commuting with these operators is a scalar. 
(d) Use this result to construct irreducible representations of the group 7 > 
az + b (a € R*, b€ R) on L?(R). What about the case where a > 0? 


29 — Fourier Transform on a Compact Group 


(i) Irreducible representations of central groups. Let G be a lcg. Schur’s lemma 
shows that the centre Z of G acts by scalars on every irreducible unitary 
representation (H,U) of G: 


U(z) =a(z)l 
where a is a character of Z. So the coefficients 


Pa,p(x) = (U(x)alb) 


of the representation are solutions of the functional equation 


f(xz) = f(x)a(z). 


It follows that the functions |f(x)| are in fact defined on the quotient group 
G/Z. Hence, if the latter is compact, in which case G is said to be a central 
group, they can be integrated over G/Z. 

Here too, Schur’s lemma is going to prove a basic property of these groups 
using arguments which, in a slightly less simple version, will lead us to the 
Bargmann orthogonality relations in n° 32.104 


104 As regards my own case, a few months after having proved the Bargmann rela- 
tions in 1947, I realized how the same type of arguments led to a direct proof 
of this theorem which all specialists, few in number at the time, obviously knew 
(at least in the case of a compact group). The “result” was new inasmuch as 
in 1927 the inventors of compact group theory, Fritz Peter and Hermann Weyl, 
were only considering finite dimensional irreducible representations. But every- 
one could prove it from their results (see Dieudonné, XXI.4, whose notation 
complicates a quite straightforward question). This proof is now attributed to 
Leopoldo Nachbin, an excellent Brazilian mathematician who took the trouble 
to publish it in 1961. 
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Theorem 43. Every irreducible unitary representation of a central lcg G is 
finite-dimensional. If (H,U) and (H',U') are two non-equivalent irreducible 
unitary representations of G defining the same character of the centre Z of G, 
then 


(29.1) ih (U(g)alb) U"(@)a"b)dg = 0 
G/Z 
for alla,b €H anda’,b’ EH’. If (H,U) = (H',U’), then 
(29.2) ih (U(g)alb) (U"@)a"lb)dg = dim(H)~? (ala’) (b]b) 
G/Z 


where dg is the normalized Haar measure of G/Z. 


For given b € H and b’ € H’, we set 
P(x) = | (U(g)x|b) (x 1b) dg. 
G/Z 


This gives 
(29.3) |Z’ (x, x") < ||b]].|!b"|.[1>I]-|l"11, 
and so [n° 19, corollary of theorem 31] a continuous linear map T from H to 
H’', satisfying ||T|| < ||bl|.||b’|], and such that 
(Tx|x’) = L, (U(g)x|b) (U'(g)x'|b')dg . 


The invariance of the measure shows that (TU(h)x|U’(h)x’) = (Tx|x’) for 
all h € G. So, if the given representations are not equivalent, then T = 0 
(Schur’s lemma III), which proves the first proposition of the theorem. 

In the second case, T is a scalar T(b’, b), whence 


T (b’, b) (x|x’) = L, (U(g)x|b) (U'(g)x'|b’)dg . 


(3) again shows that T(b’,b) = (T'b’|b), where T € £(H), and the invariance 
of the measure again shows that T commutes with all U(g), and so is a scalar 
A. Thus 


d (b'|b) (x|x") = L, (U(g)x|b) (U(g)x'|b‘)dg . 


Then let e;, e2,... be a finite or infinite orthonormal basis of H. The functions 


wij(g) = (U(g)eiles) 


are pairwise orthogonal in L?(G). Using Kronecker’s index, we even have 
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(29.4) (ui |Upq) = dipdjqr, 


where the scalar product is obviously calculated on G/Z. So |luj;||3 = A. 
Since ||x||? = > |(xle;)|? for all x, 


dE lus? = DE IO @elenl? < S0U)eil? . 


ij<n ij<n i<n 


This result equals n since all U(g) are unitary. Integrating the left hand side 
shows that n?\ <n, and son < \~!, which proves that dim(H) < A~!. For n 
maximum, the inequality is replaced by an equality and thus \ = dim(H)~1, 
qed. 

The orthogonality relations can also be written differently by considering 
the integral 


/ THAU (| THBU@)ldg. 


As a function of the ordered pair (A,B), it is a positive Hermitian form 
on £(H), and equals Tr(ASB*) for some unique S € L(H). As above, the 
invariance of the measure shows that S commutes with all U(x), and so is 
a scalar \. For A and B choosing operators whose matrices, with respect to 
an orthonormal basis of H, only contain one non-zero entry, we recover the 
orthogonality relations of the theorem and \ = dim(H)~!; and 


(29.5) | MAU @FIBO Ody = aimr)t TAB"). 


(ii) Central functions on a compact group. The functions f(g) = Tr[U(g)] 
satisfy f(ay) = f(yz), ie. are central functions on G. Over a compact group, 
the formula 


(29.6) f'(g) = i: f(e*gs)ds 


transforms every f € L(G) into a continuous central function equal to f if f 
is central, and non-zero if f is > 0. Proving the formulas 


(29.7) (f*)! =f", (flo) =(f'lg) if g is central, 
(29.7’) (Pag a7 eg 


using the Haar measure will be a simple exercise for the reader. Clearly, 
(29.8) U(ft) = / U(s)U(f)U(s)~L.ds 


on any finite or infinite-dimensional continuous linear representation (H,U) 
of G, for all functions such that f[ ||U(x)||.| f(a) |da < +00. 
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The group G acts on the space G by the map (s,g) ++ sgs~'. Being 
compact, G acts properly [n° 15, (iii), lemma 3]. The quotient space X of 
conjugacy classes of G is, therefore, separated and even compact. Lemma 4 
of n° 15, (iv) on quotient measures shows that if the central functions are 
considered to be defined on the quotient space, the map f +> f® transforms 
L(G) into L(X). Computing the quotient measure when G is a Lie group is 
a famous exercise using the structure of semisimple Lie algebras and enables 
us to compute explicitly the characters of G (H. Weyl!°°). 

Using central functions, we get the Dirac sequences at the origin. Indeed, 
transforming by (6) all f > 0 with integral 1 of a Dirac sequence at the origin 
gives positive functions f% with integral 1. If f vanishes outside a neighbour- 
hood W of e, f# vanishes outside the union of sWs~!. So the proof reduces 
to showing that for any open neighbourhood V of e, there is a neighbourhood 
W such that sWs~' C V for all s. To this end, we associate to each open 
neighbourhood W of e the set W’ of s such that W Cc sVs~'. As V and W 
are open, so is W’. As sVs~! is a neighbourhood of e for all s, every s € G 
belongs to some W’. Since the open sets W’ cover the compact space G, there 
are neighbourhoods W,...,W, such that Ww, cover G. This means that, for 
all s € G, there exists p such that sVs~' > W,. Hence 


sVs 1D WiN...NW,=W, 
qed. This result shows that, for y € L?(G), 
(29.9) f*xy=0 for allcentralf = yp=0. 


The map § can be extended to all L?(G) and is still defined almost every- 
where by relation (6). To begin with, 


Jf seas asda = f Flag 


for all f € L(G). This means that the image of the measure dsdg under 
the map (s,g) +> sgs/ is the invariant measure of G. So theorem 23 of 
n° 14 can be used: for all f € L?(G), the function (s,g) > f(sgs~') is in 
L?(G x G), and so is integrable. As a result, [ | f(sgs~')|ds < +00 for almost 
all g, which makes integral (6) well-defined almost everywhere. Finally 


[fle lea) Pas = flor ag 


shows that || f°, < || f||p. This is obtained by extending the map f +> f? 
from L(G) to L(G) by continuity to any L?(G). Z(G) will denote the set of 
central functions in L(G) and Z?(G) the set of central functions in L?(G). 


105 The general formula can be found in Dieudonné, Eléments d’analyse, chap. XXI, 
p. 106, the main problem being that notations are hard to understand. 
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On the other hand, f is clearly central if so is f. This is also the case of 
f *g if so are f and g. An easy calculation shows that central functions are 
characterized by the fact that 


f*xg=g9*f forall ge L(G). 
In L?(G), the set of operators 
L(if):g-ofeg, feZ(G) 


is, therefore, a self-adjoint commutative algebra whose elements commute 
with all left and right translations 


D(a): g-se*g, R(x): grog*ey 


as well as with all left and right convolution operators. So do the operators of 
the Gelfand-Neumark algebra Z(G) generated by these L(f), i.e. the limits 
in norm of operators L(f) + a1, where a € C and f € Z'(G), and of 
the corresponding von Neumann algebra’°° Z(G), i.e. the (weak, strong, 
ultrastrong) limits of the same operators [n° 19, (vii)]. It is not necessary to 
add scalar operators to the set of operators L(f) since using Dirac sequences 
shows that the unit operator is the ultrastrong limit!©” of operators L(f). 


As an interlude, let us show that Z(G) is in fact the set of all operators 
commuting with all right and left translations. Indeed let L(G) = £ be the 
von Neumann algebra generated by all L(x) and R(G) = R the algebra 
generated by all R(x). Clearly, £ Cc R’ and R Cc CL’. In fact, £ = R’ and 
R = L'. We will generalize this result [n° 31, (ii)] to all unimodular groups 
later. To see this, let us consider operators A € £L’ and B € R’. A commutes 
with all L(x), so with L(f) for all f € L'(G), hence for all f € L?(G) since 
G is compact. So, for f,g € L?(G) c L'(G), 


A(f *g) = AL(f)g = L(f)Ag = f * Ag 
and similarly B(f * g) = Bf * g; whence 
AB(f *g) = A(Bf * 9) = Bf * Ag = B(f * Ag) = BA(f * 9). 


These calculations are well-defined in a compact group since the convolution 
product of two functions in L?(G) is again in L?(G). It is in fact a continuous 
function. As the set of convolutions f *g is everywhere dense in L?, it follows 
that AB = BA for all A € £L’ and BE R’. Hence, using the notation of von N 
[n° 19, (vii)], 2’ C R” and R’ c L”. But by his density theorem, R” = R 
and £” = £. So 


106 Henceforth, italicized capital letters will be used for von Neumann algebras and 
bold capitals for GN algebras. 

107 By lemma 6 of n° 19, (vii), it suffices to shows that if a function g € L7(G) is a 
zero of all operators L(f), then g = 0. This is obvious thanks to Dirac sequences. 
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We can now come back to the algebra Z(G) generated by the operators 
L(f) = R(f), for central f. We need to show that every operator A commut- 
ing with all left and right translations, i.e. such that A € LOR, is in Z(G). 
The functions f € Z'(G) being characterized by the relation L(x) R(x) f = f, 
Af is central if so is f. Let us then choose a Dirac sequence consisting of mea- 
sures f,(a)dx, where the functions f,, are central. The sequence of L(f,,) con- 
verges weakly to the unit operator, so that the sequence of AL(f,,) converges 
weakly to A. But since A € R, AL(fn)g = A(fn * 9g) = Afn *g = L(Afn)g 
for all g € L?. As all Af, are central, L(Af,) € Z(G), so that A is the limit 
of operators belonging to Z(G), and so is in Z(G). Thus, indeed 


Z(G) = L(G)N RG). 


In the case of a general lcg, an algebra Z(G) generated by convolutions can 
always be defined using integrable central functions. But the only integrable 
central function on G may be the trivial one, for example if G is a semisimple 
non-compact Lie group such as SL2(R). Then Z71(G) = {0}. In this case, the 
algebra Z(G) is defined by the previous formula,'°® but it is then impossible 
to replace Z(G) by a GN algebra whose spectrum would be much more 
reasonable than the totally discontinuous one of Z(G). This is where the 
difficulty of the general case lies. 


Using Z(G) in the same way as the algebra A(G) of the commutative 
case [n° 25, (i)], we will obtain the entire theory of compact groups.1°° To 
understand the method — there are other ones -, it is useful to start with the 
proof of some formulas related to irreducible representations. 

By Schur’s lemma, the operators U(f), f € Z'(G), on any irreducible 
representation (H,U), are scalars. Setting U(f) = x(f)1, 


x(f) = dim(H)~* TU (f)] = SplU(F)]- 


This convenient notation (Spur = trace in German) restricts the legions of 
factors dim(H) that clutter most presentations. Sp(1) = 1 and as Sp is a 
linear functional on £(H), 


SpU(f)] = / SplU (@)|f (wax = / fla)x(ae)de, 


where 


108 Z(G) is the von N algebra generated by Z(G) if and only if every neighbourhood 
of e in G contains an invariant neighbourhood under inner automorphisms. Apart 
from central groups, about the only ones to have this property are the discrete 
groups. 

109 The reader is advised to interpret all the following calculations and results in the 
case where G is commutative. 
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(29.10) x(x) = Sp[U(ax)] = dim(H)~! Tr[U(x)] 


is a central function proportional to what is usually called the character 
of (H,U), namely the function Tr[U(x)]. As y(e) = 1, I will call (10) the 
normalized character of the representation considered. Obviously ¥ = x. As 
there is a relation analogous to (5) for inequivalent representations, except 
that the right hand side is then zero, putting A = B = 1 shows that the 
(standard or normalized) characters of two inequivalent representations are 
orthogonal. 
The map 


fans / Ha)x(a)de 


is a character of the normed algebra Z!(G) which, in fact, can be extended 
to Z(G) since, all irreducible representations (H,U) being realizable on!!° 
L?(G), U(f) is the restriction to H of the convolution operator L(f) by f 
whose norm on H. is bounded above by its norm on L?(G). Thus ||U(f)|| < 
\|L(f)||. Relation (10) can be generalized to all f € L'(G) in the form 


Sp[U(F)] = x(F*), 
since, by invariance of the trace, 
SplU(f)] = Sp [U(s)U(f)U (s~*)] 


for all s € G. Thus integrating with respect to s and using (8) lead to the 
formula. We also write down the relation 


x Flo) = f Sp[U (eu)] Floddy = SUV 
where f’(y) = f(y~'). If f is central, U(f’) reduces to the scalar 


slur) = | Sele WY] Fay = (bd. 
Thus 
(29.11) x*f=(flxy)x if fe Z(G) 
and in particular 
(29.11’) X*X=(KIN)X, X*X =O ify AX’. 
110 Choose a non-zero vector a in H. and associate the function 
g— (U(g *)xIa) 


to each x € H. This map transforms the operator x +— U(s)x into the left 
translation L(s). 
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We prove the second relation by first observing that, by (11), y * x’ = 
(xlx’)x = 0, then that y and x’ are orthogonal as was seen above. 

Normalized characters have a simple functional equation. For all x, the 
operator f{ U(uzu—')du commutes with the representation, and so reduces to 
the scalar 


Sp | / U (ueu-") au| = i Sola ae / SHAN e es 
Hence 
(29.12) fe (ucu~') du = Sp[U(a)]1 for all eG. 
We then deduce that 

i: ¢ (ue ty) du = Sp | / U (uu) U(y)du| = Sp[U(@)] Sew) 

i.e. that 
(29.13) [x (ucu~'y) du = x(x)x(y) - 
Setting U(«) = A, relation (12) becomes 
(29.14) / U(u)AU (u)~*du = Sp(A) 


and in fact holds for all A € £(H), since any operator A is a linear combina- 
tion of operators U(«) by von Neumann’s density theorem, which is almost 
trivial and purely algebraic in finite dimension (“Burnside’s lemma”). We 
could have also observed that the left hand side commutes with all U(x), and 
so reduces to a scalar necessarily equal to Sp[f U(u) AU (u)~*du] = Sp(A). 

We will again encounter these formula for the simple reason that, like in 
the case of a commutative group, (classes of) of irreducible representations 
of G correspond bijectively to the characters of Z(G). 


(iii) Spectral decomposition of Z(G). Let us denote the compact spectrum 
of Z(G) by X(G) and let G be the set of y € X(G) that are not identically 
zero on Z!(GQ). Since G is the complement of at least one element of X(G) — 
obviously X(G) = G if G is finite —, it is a locally compact space (in fact, 
discrete as will be seen) for which the forgotten character of Z(G) plays the 
role of the “point at infinity”. For x € X(G) and f € L1(G), 


Ix (ZF))| < JEP < FL, < tlh 


So there is (duality between L1 and L®) a bounded measurable function 
x(x) such that 
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x [ECF] = / H(a)x(a)de 


for all f € L(G). The function y may be assumed to be central — otherwise, 
replace it with x4 —, which determines it completely since then 


e915) f Fle)x(aiae = f f(e)x(v)ae = x [EU] - 


It is zero if y is the point at infinity of G. 
For all f € L1(G), we will set 


(29.16) = [| fo@xae = x(f") = (fIX)- 


For given f, we get a continuous function y +> y(f) on C tending to 0 at 
infinity by definition of the topology of X(G). The equality 


x(f *9) = x(f)x(g9) 


holds if f or g is central because of the second relation (7). 
If x is a character of Z(G), so is the conjugate function 


x(f) =x(f). 
Let us now show that, for all y € GC 
(29.17) x* x(x) = (xlx)x(@) 
(a priori almost everywhere) and more generally that 
(29.18) f* x(x) = (FIx)x(2) 


for all f € Z'(G). To this end, let us set g'(x) = g(x~') for all functions g. 
Clearly, (g * h)! = h’ * g' since G is unimodular. Moreover, f g(x)h(x)dx = 
g'*h(e) for all g, h and in particular y(f) = f’*x(e). Hence, setting y* f = 9, 
i.e. a central function like f and y, for any function h € L'(G), 


J Ma)sela)de = = ole) = Nl fe x(0) = (hl $Y *x(e) = xh F) 
= x(h)x(") = x(F") / h(x).x()de 


since f’ is central, whence y = y(f’)y = (f|x)x almost everywhere, qed. 

As the function x * x is continuous, x(a) may be supposed to be continu- 
ous, which determines it everywhere. It then satisfies (17) and (18) including 
with respect to a null measure, and as y * y(e) = (x\|x), we deduce that 


x(e) =1. 
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Exercise 1. Using the fact that 


x (fo 9") =x (f°) x (9") = x(f)x(g) 
for all f and g € L(G), show that 


(29.19) [x (uvu*y) du = x(a)x(y) - 


If x and y’ are distinct characters, (18) shows that y * x’ is proportional 
to x and x’. As two proportional characters of a normed algebra must be 
identical, 


(29.20) x*x’=0, (xlx’/) =0. 


However, for all f € Z!(G), the function (|x) is continuous on G. This is the 
case of x > (x’|x) for all x’ € G. Since this function vanishes everywhere 
except at y = y’, the subspace G = X(G) — {co} is discrete. 

The spectral decomposition 


T = | FO)dM(x), Te Z(G), 
x(G) 


of n° 22 is then particularly simple. First, for each y € G, by (22.14), the 
spectral manifold L?(G;) associated to the open set {x} C X(G) is the set 
of y € L?(G) such that 


Ty=x(T)y for all T € Z(G), 


or even only for T = L(f) where f € Z1(G). In this case, y(T) = x(f) = 
(flx) by (16), whence 


(29.21) fep=x(f)y=(fX¢e.- 


If f = x’ € G, then y(f) = f y/(x)x(a)dx = 0 except if x’ = YX, in which 
case x(f) = (x]x). So 


(29.21’) X*~ = (xIx)¥ 
in L?(G; x). Conversely, (21’) implies (21) since 
(xd fxe= fae) =(f*x) *9 = (FIX * 4 = x(x 
by (18) for x. Hence 
(29.22) ge L(G;x) = X*9=(lxIe- 


Given (17) for x, X € L7(G;x). 
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According to the general results of n° 22, (iii), the spectral manifold as- 
sociated to the open and discrete subset G = X(G) — {oo} of X(G), is the 
Hilbert direct sum of the spaces L?(G; x) for all y € G. Therefore, like in any 
GN algebra, the spectral manifold associated to X(G) = Gu {co}, namely 
L?(G) is the direct sum of the spaces L?(G; x) and of the spectral manifold 
associated to {oo}. It is trivial since its elements satisfy L(f)y = 0 for all 
f € Z(G), whence y = 0 by (9). So the spectral manifold associated to G is 
L?(G), and so 


(29.23) P(G)=OVr(eG»), 


xEG 


which is a Hilbert direct sum. 

Let us compute E(x), the orthogonal projection operator onto L?(G; x). 
The operator f —+ yx f is Hermitian and proportional to its square by (17). 
By (22), it reduces to the scalar (x|y) on L?(G; x) and it obviously vanishes 
on the other spectral manifolds L?(G; x’). So it is proportional to E(y), and 
as E(x) needs to equal 1 on L?(G; x), 


(29.24) E(x)f = (nlx) -x*f 


necessarily holds for all f € L?(G). 
Permuting y and Y in (23), for all f € L7(G), 


(29.25) fa Ob aes 


i.e. a series of pairwise orthogonal functions, and 


(29.25’) (f19) = So Od) 7c * Fx * 9) 


for all f and g. Summation is over x € G, series (25’) converging uncondi- 
tionally (Chap. II, n° 15). 


(iv) Characters of Z(G) and irreducible representations. As operators T € 
Z(G) commute with all the right and left translations, the spaces L?(G; x) 
are bi-invariant. Let us show that, the right and left translation operators on 
L?(G; x) define an irreducible representation of G x G. 

Indeed, let A be an operator on L?(G; x) commuting with these trans- 
lations. Extend it to L?(G) by requiring it to be zero on the other spaces 
L?(G;y’). This gives an operator T belonging to the von N algebra L(G) 
R(G) = Z(G) generated by Z(G) [end of section (ii)]. As any T in Z(G), or in 
Z(G), reduces to a scalar on L?(G; y), so does A, which implies irreducibility. 

G x G being compact, theorem 42 shows that the subspaces L?(G; x) are 
finite-dimensional. Then let # be a minimal non-trivial left invariant sub- 
space of L?(G; y). Denoting by U(x) : ¢ -> €, * the restriction of L(x) to 
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H, the representation (H,U) of G is irreducible. For all f € Z'(G), the op- 
erator U(f) = f{ U(x) f(ax)dz, which is the restriction of L(f) = L(x) f(x)dx 
to H, reduces to a scalar, namely 


Af) = SeU(A)] = i f(a)Ma)de, 
where 
A(x) = SplU(2)] 


is the normalized character of the representation (H, U) defined in section (ii). 
As every T € Z(G) reduces to the scalar y(T) on L?(G; yx) and so perforce 


in H, A(f) = x{L(f)] = Jf f(x)x(«)de. Thus 
(29.26) x(x) = Sp[U (x) 


for any irreducible component (H,U) of the left regular representation of G 
on L?(G;x). Hence the algebra Z(G) only has characters defining the irre- 
ducible representations of G (and the character “at infinity”). Exercise 1 was 
therefore superfluous since we already knew (13). 

We can now reconstruct L?(G; x) using (H,U). To begin with, by (24), 
L?(G; x) is the set of functions of the form f *¥. However, for all f € L?(G), 


fex(2) = i fley)x(y)dy = ‘. flay) SplU (y)]dy 
and so 


(29.27) f*X(a) = SpU (a) Uf). 


As every A € L(H) is in some U(f) because of irreducibility, it follows that 
L?(G; x) is the set of functions of the form 


xr Sp [AU (x)~*] ; 


The preceding calculations assume that a minimal left invariant subspace 
H has been chosen, but changing 1 replaces the representation (H,U) by an 
equivalent representation since, as was seen above, the characters of two in- 
equivalent irreducible representations are orthogonal. In conclusion, the sub- 
spaces L?(G; x) correspond bijectively to irreducible representations of G. 
The equality 


(fla) = So Oxdx) 7x * fx *9)- 


can now be written differently. Calculations leading to (27) also shows that 


f*x(@) = Sp[U(@)U(f)) , 
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where f’(x) = f(a~+). So, using the orthogonality relations in the form (5), 


(x* fhe a) = f SU UE) OCT OTae 
= dim(H)~* Sp[U(f)U(9')"] = (xix) SplUF Ug’) 
because (5) for A = B = 1 also shows that 
(xlx) = dim(H)~?. 
Hence 
(xl 7 0c * fix * 9) = Ol)? Sp[U(F)U(9')"] 


= dim(H)? Sp[U(F)U(')") 
= dim(H) TrU(f)U(G')"I - 


This gives Plancherel’s formula (it is better to say Peter-Weyl’s, even if they 
argued in terms of matrices instead of operators) 


(29.28) (flg) = S¢ dim(H) Tr[U(f’)U(9')"] , 


where the summation is over the irreducible classes of representations. Be- 
sides, as (f’|g’) = (flg), f’ and g’ can also be replaced by f and g on the 
right hand side. 

To obtain a formula which, for a commutative G, reduces to those of the 
previous §, for each vy € G, we could choose an irreducible representation 


xr U(x; x) 


with character x in the space H(x) and set 


(29.29) fo = / U(w;x)*f(a)de = US's). 


For every f € L'(G); this Fourier transform of f, a function whose values 
are operators, reduces for central f to the scalar(f|y). Denoting by dim(y) 
the dimension of H(y), (28) becomes 


(29.28") (fg) = So dima) Tr] F00G00)"] 


for f,g € L?(G). This immediately gives formulas 


n~ 


(29.30) g(a) = flaaxb) => G(x) = U(b, x) FOQU(a, x) ; 


n~ 


(29.31) (F* 9) (x) =GO0F(0)- 
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Exercise 2. Prove an analogue of the Fourier inversion formula. 

Exercise 8. Consider the von Neumann algebra Z(G). Every T € Z(G) 
acts on each subspace L?(G; x) by a scalar T(x). Show that this gives all the 
functions such that 


sup |T(x)| < +00. 


To conclude, let us show how a unitary representation (H,U) of G can be de- 


composed.!" Since, by (17), the function (y|y)~!y is Hermitian and identical 
to its convolution square, the operator 


(29.32) E(x) = (lx)? } U(e)x(@de = (x/x) 1) 


is a projection onto a subspace H(y). Since y x x’ = 0 if xy 4 yx’, these 
subspaces are pairwise orthogonal, and invariant since the characters are 
central functions. For a,b € H, age-old calculations give 


(xl? f (Uley)alb) x(y)dy = (U (a) E(x)alb) = (U(x) E(x)alE(x)b) - 
If a € H(y), the coefficient 


(29.33) (U(a)a|b) = Yap(x) 


of (H,U) thus satisfies (x|y)~t.v * Ya,b = Ya.b, and so is in L?(G; x). There- 
fore, as b varies in H,, functions (33) remain in a finite-dimensional subspace 
of L?(G), in fact of L(G). It follows that the subspace H(a) of H generated 
by all U(z)a is also finite-dimensional. As it is invariant, it is the direct sum 
of minimal non-trivial invariant subspaces on each of which the operators 
U(x) act irreducibly. The representations thus obtained on H(a) all belong 
to the class y considered since there operators (32) are equal to 1. As for the 
subspace H(y), it can be infinite-dimensional, but as the orthogonal comple- 
ment of some H(a) in H(x), it is also invariant. H(y) is the Hilbert direct sum 
(in several ways!) of subspaces H(a), hence of finite-dimensional irreducible 
subspaces, each of which correspond to a representation with character x. 

It remains to be shown that H is a Hilbert direct sum of the subspaces 
H(x). This is an exercise whose details can surely be omitted at this point 
of the presentation. 


(v) Easy generalizations. The reader, having probably observed similari- 
ties between the methods used for commutative groups and compact groups, 
will suspect that it should be possible to address both cases simultaneously. 
There are three ways at our disposal to achieve this. I will only briefly outline 
them. 


‘11 Apart from orthogonality questions, the following arguments apply to all contin- 
uous representations in a Banach space. 
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(a) The obvious method!” consists in generalizing calculations and results 
to central groups (G/Z compact) and, for every character a of the centre Z 
of G, in replacing L?(G) by the space L?(G/Z; a) of mod Z square integrable 
solutions of f (xz) = f(x)a(z). There is still a map f +> f* from L!(G) onto 
its centre Z1(G), given by 


fX(a) = Lf (uzu-") du. 


The convolution products by f € Z(G) obviously preserve the space 
L?(G/Z; a) on which an ersatz of the convolution can be defined by setting 


f *z 9(2) =f, fev) ody. 


In this space, (ordinary) convolutions by f € Z!(G) generate an algebra 
Z(G/Z;a) with the same properties as the algebra Z(G) of the compact 
case. As above, one can show that the characters of this algebra are the 
functions x(x) = Sp[U(x)] of the irreducible representations (H,U) on which 
the centre Z de G acts by a. Denoting by L?(G;) the subspace of functions 
Tr[AU (x)~*], where A € L(H), L?(G/Z;a) is shown to be a Hilbert direct 
sum of these subspaces. If, for any character x, (U(x; x),H(x)) denotes a 
representation with normalized character y(x) = Sp[U(a; x), the projection 
of f € L?(G/Z;a) onto L?(G; x) is the function 


++ dim(x) L, Tr [U(xy~*;x)] f(y)dy. 


On the other hand, associating the function 


7 / HOO ROLE 


to each f € L(G) gives functions f, everywhere dense in L?(G;a). The 
projection of f, onto L?(G; x) is the function 


dim(x) Joyz dy Jz Tr [U(xy~ fe es F(ye- a (z )d a= 


= dim(x) f dy f He pay x hk (yz)dz = 
= dim(x) te Tr [U( Ey )| FC y)dy , 


ie. setting 
Fo) = f Ulex)" H@)de 


"2 §. Grosser and M. Moskowitz, Harmonic analysis on central topological groups 
(Trans. AMS, 156, 1971, pp. 419-454). Also Calvin Moore, Groups with finite- 
dimensional irreducible representations (idem, 166, 1972, pp. 401-410). 
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n~ 


as in the compact case, this is the function dim(x) Tr[U(x; x) f(x)]. There is 
again an expansion 


(29.34) (folga)= D> dim(x) Tr Fod900"| 


xXx=a sur Z 


for f,g € L(G). But Plancherel’s formula for Z shows that the inner product 
(f|g) on L?(G) is given by 


(fla) = fL if [ sesiateajae = f x [tole aat2)aa, 


i.e. by 


(29.35) (fig) = i (falga)der. 


Hence, given (34), 


(fla) = I da Sy dim(x) TLIO"). 


x=a sur Z 


This can be simplified further. In the locally compact space G of the char- 
acters of Z1(G), the relation {y = x’ on Z} is an equivalence relation whose 
classes are discrete sets. The quotient space can be canonically identified to 
the dual Z of Z. A measure dx can then be defined on G by setting 


for all f € L(G). This is a very simple example of a continuous sum of discrete 
measures (n° 13). This definition leads to the final formula 


9.36) f s(wjat@iae= ft [Foaoy"] aimoyar. 

It unifies the results obtained for commutative groups and compact groups. 
Though the subject is not of much use, proving all these formulas without 
having recourse to we already know about commutative or compact groups 
would be one of the best exercises for the reader to familiarize himself with 
these methods. 


(b) A significantly more difficult generalization!!? as it uses G. W. Mackey’s 
theory of induced representations consists in supposing that G contains a nor- 
mal commutative closed subgroup (but not necessarily central) Z such that 
G/Z is compact. 


113 Kleppner and R. L. Lipsman, Plancherel formula for group extensions (Ann. sc. 
Ecole normale sup., 5 (1972) and 6 (1973)). 
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(c) The third and an extremely ingenious and much more useful method 
is Gelfand’s theory of spherical functions. Suppose we are given a compact 
subgroup K in G such that the convolution product is commutative on the 
set of functions such that f(uav) = f(x) for all u,v © K. A trivial example: 
G = kK x Kk, the compact subgroup being the diagonal of the product; the 
solutions of f(uxv,uyv) = f(x,y) can be identified to the central functions 
on Kk. A far less trivial example: suppose there is an “involution” 2+> 2’ in 
G satisfying 

(c=, (sy)! =y'e’, wv =a forallaeckK, 

and that all « € G can be written as « = kp with k € K and p’ = p. 
For G = GL,,(R), K is the orthogonal group and 2’ the transpose of the 
matrix 2. The situation is the same in every real semisimple Lie groups.'!!4 
It then only requires one line of calculations to check that the convolution 
product, restricted to K-bi-invariant functions, is commutative for the simple 
reason that we then have f(z’) = f(x). In all cases, the set of integrable K-bi- 
invariant functions is again a commutative normed algebra whose characters, 
in other words the functions said to be spherical, are the (necessarily bi- 
invariant) solutions of the functional equation 


/ Haky)dk = f(0) f(y). 


There is still a Plancherel’s formula not for all of L?(G) — it would be mirac- 
ulous — but, for want of anything better, for the subspace of L?(G) generated 
by the images of bi-invariant functions under right and left translations. Only 
irreducible unitary representations of G admitting a K-invariant vector are 
involved in this formula. This vector is necessarily unique up to a scalar. This 
method was adopted by Dieudonné in his volume 6, and I could have done 
the same.'! Regardless of its merit, it seemed more reasonable to me not to 
throw into disarray readers who are not expected to already know everything. 
In addition, the real problems — explicit calculations of irreducible spherical 
functions using integral formulas, explicit calculations of Plancherel’s formula 
— cannot be solved by this type of abstract nonsense, however ingenious it 


114 As a first approximation, these are the closed subgroups of GL,,(R) satisfying 
the following conditions: (i) G is algebraic, i.e. defined by polynomial equations, 
(ii) Every commutative closed normal subgroup of G is finite. Examples: SZ,,(R), 
the group of automorphisms of symmetric or alternate bilinear forms (orthogonal 
or symplectic groups), the subgroups of SLn(C) leaving a symmetric bilinear 
form or a Hermitian form invariant, etc. There are also “exceptional groups” that 
cannot be easily defined. .. General semisimple groups are coverings of semisimple 
linear groups. The most simple general definition uses the Lie algebra of G. 

I presented it at the Séminaire Bourbaki (1958) adding to Gelfand’s short note 
in Doklady a Plancherel formula which did not appear there. My talk, or that of 
Serge Lang, SZ2(R) (Springer, 1985), Chap. IV, is probably easier to read than 
that of Dieudonné’s. . . 


11 


ou 
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may be. They have been completely solved by Harish-Chandrat!® in arti- 
cles that display his genius, a term that should not be used lightly, and his 
prodigious obstinacy. 

We are now going to drop the subject of compact groups and briefly 
outline possible (or impossible) extensions to general Icgs.!1” 


30 — Measures and Functions of Positive Type 


(i) Measures of positive type. Let G be a locally compact group. As was seen 
in n° 25, the inner product on L?(G) is given by 


(30.1) (flg) =e@*f) , 
where € is the Diract measure at e. It follows that the map 
(30.2) (f,9) > €(9* f) 


is a positive Hermitian form on L(G), and one obviously obtains L?(G) by 
applying the general method of n° 19 to it . 

It is thus necessary to more generally consider measures yp on G' having 
this property, namely measures of positive type (notation: w >> 0). Such a 
measure enables us to define a unitary representation of G, which, for uw = e, 
is just the “left” regular representation of G on L?(G). 

Indeed, if p: >> 0, then taking the quotient and the completion, the general 
method of n° 19, (i) leads to a Hilbert space H(j) and to a canonical map 
f > f, from L(G) onto an everywhere dense subspace of H(y). By definition, 


(30.3) (fulgu) =H (G* f) = | flyx)g(y)du(x)dy 


follows from (28.3’). If f and g are made to undergo the same left translation, 
the function g * f does not change. So, if f,, = 0, neither do any of images 
of f under left translations. Hence, for each x € G there is a linear operator 
L,,(x) defined on the subspace of f,, by 


(30.4) 9 = x * f = Gu = Lyl(2)fu- 


It is bijective, preserves the inner product and can be extended to all of H (1). 
This gives a unitary representation of G on H(). To check that the maps 


16 About Harish-Chandra, the “second Ramanujan” as he is now called in India, see 
the articles by Borel, Helgason, Langlands and Varadarajan in Robert S. Doran 
& V.S. Varadarajan, eds., The Mathematical Legacy of Harish-Chandra (Amer. 
Math. Soc. 2000). India has produced other first-class mathematicians, but only 
rarely does one meet there or elsewhere Harish’s level. 

7 For the rest of this chapter, it will be useful to refer to the second part of 
Dixmier’s more complete presentation, C*-algebras (North-Holland, 1972), which 
has the inconvenience or advantage, depending on one’s point of view, of being 
based on the general theory of GN algebras. See also Nolan R. Wallach, Real 
Reductive Groups II (Academic Press, 1992), Chap. 14. 
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x ++ L,,(az)a are continuous for all a € H(z), it suffices to do so for all a 
everywhere dense, for example, for f,,, where f € L(G), and only at « =e. 
As 


|Lu(2) fu — full? =h[(eo* f—-f) *(ee*f—f)], 


it suffices to check that x4 p(f * €, * f) is continuous. 
Every f € L(G) defines an operator 


(30.5) L,(f) = ‘3 L (a) f(a)de 


on H(,s). By definition, 


(Ly(f)gelhu) = f Cu (eoulhy) fa de = 1 (fix ee *9) fle)de 
=p(he fg) =((F*9)plly) 
[n° 25, (ii), exercise 1]. We then deduce that 
(30.6) h=feg—h=L(f)ou 


for f,g,h € L(G). 

This result could be interpreted by first observing that, by Cauchy- 
Schwarz, in the algebra L(G), the relation p(f * f) = 0 is equivalent to 
u(g * f) =0 for all g € L(G), whence p(f *g*g * f) = 0. The set of f such 
that f,, = 0 is, therefore, a left ideal I(y) of L(G). But if J is a left ideal of 
a ring A, A can be made to act on the quotient A/J since, for f,g € A, the 
coset of f * g mod I only depends on the coset of g. Formula (6) falls within 
this algebraic framework. 

The Hermitian symmetry of u(g * f) shows that 


u(g* f) =u(f *9)- 


Making the measure g(x)dx converge to the Dirac measure at e gives p(f) = 


u(f). Thus 


(30.7) fi=p if po. 


When G is a Lie group,!!® one can consider C® functions on G, and 
thus define the Schwartz space D(G) of C™ functions with compact support, 
hence also distributions on G by generalizing in an obvious manner what has 
been said in Chap. VII for R. For f,g € D(G), f*g € D(G), which enables us 
to define distributions of positive type by requiring that y(g* f) be a positive 


118 je. equipped with a C® manifold structure such that the map (x,y) > xy’ is 
Ccé. 
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Hermitian form on D(G). The construction of the space H(j:) and of the 
unitary representation «++ L,,(x) can be generalized to this case. 


(ii) Case of a commutative group. Since the representation defined by ¢ 
leads to Plancherel’s formula, one expects a similar result for all measures 
> 0, namely: 


Theorem 44. Let G be a commutative lcg and a measure of positive type 
on G. Then there is a unique positive measure ji on G such that 


(30.8) u(f*g) = if FOTOOEACY) 


for all f,g € L(G). 


The proof is almost the same as that of theorem 40. Consider the set 
A(j) in H() of limits in norm of operators of the form L,,(f) + A1; it is a 
commutative GN algebra and the proof reduces to setting out explicitly its 
spectral decomposition. To start with, as 


Zu(A)Il S Wf lh, 


any character of A(j) defines a (possibly trivial) character of L1(G), which 
gives a homeomorphism from the spectrum A(j:) onto a compact subspace 
X,,(G) of X(G) = G or GU {oo}. The Gelfand transform T + T for A(j1) as- 
sociates the restrictions to X,,(G) of the Fourier transforms flx ) to operators 
L,(f). Hence by the general theory of n° 22, (ii) et (iii), for f,p,q € L(G), 


(Eulf)Pplau) = / FOO ating) 


where j/ip,q is, in theory, a measure on the compact set X,,(G), and so can 
be identified to a measure on X(G) concentrated on X,,(G). Any bounded 
Borel function y on X(G) defines an operator M(y) on Up ) such that 


(M(e)pula,) = / COO dbgad: 


As was seen using formula (27.18) in the case of the regular representation, 
{oo} is a null set with respect to the measures [tp.g, which, therefore, become 


bounded on G. 
Then, replacing y by fa, where f,g € L(G), in the previous formula, 
| Fooa I(x) UUp, a(X) = (Ly(g)* Lu (Pf) Pulau) 


= 1 (q* 9% f*p)=U(G*qd*p* f) 


if BOT Odds g(x) - 
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We then deduce that 


Fd ICO dtp,a(x) = PODTD dtey,g(x) 


for all f,g,p,q € L(G). The end of the proof is now obvious. 

This theorem shows that f € L? (G; i) for all f € L(G) and that the map 
fro f can be extended (modulo taking the quotient) to an isometric map of 
H(1) in L?(G; 7). As in the classical case (ju = €), it is bijective (exercise !). 

By definition, the measure ji is the Fourier transform of jy. Plancherel’s 
traditional formula thus says that the Fourier transform of the measure e€ is 
an invariant measure of G. 

If G is of the form R? x T?, in which case G = RP? x Z4, the arguments of 
theorem 44 associate a measure {i > 0 on G to each distribution pb > OonG, 
with formula (8) holding for f,g € D(G). We leave it to the reader to fill in 
the gaps in this proof if he feels the need to do so. The space S(G) of rapidly 
decreasing C® functions on G can also be defined over such a group, i.e. 
functions for which the product of any derivative and a polynomial function 
tends to 0 at infinity. Hence the notion of tempered distributions on G as in 
Chap. VII. If such a distribution py is of positive type, D(G) can be replaced 
by S(G) in the previous arguments and formula (8) thereby obtained for all 
f,g € S(G). It follows that the Fourier transform of a tempered distribution 
of positive type is a positive tempered measure, which means that there is a 
polynomial function p on G such that the measure p(y)dji(x) is bounded. 

Measures ji obtained from distributions of positive type are not the most 
general positive measures on G. To obtain these, D(G) or S(G) would need to 
be replaced by the algebra (with respect to the convolution product) W(G) of 
functions whose Fourier transform has compact support (for G = R, see the 
Paley-Wiener theorem in Chap. VIII, n° 12). It is then possible to associate 
to any measure v > 0 on G a linear functional  : f + v(f) on W(G) for 
which 


u(f * f) =v (If?) > 0. 


In fact all this falls within the scope of a general theorem which Dieudonné 
(XV.9) strangely called “Plancherel-Godement” as he had heard me present 
it around 1950. As it was an almost trivial generalization of the method 
used by Cartan and myself to present the Fourier transform on commutative 
groups, I did not publish the result. 

Exercise 1. Suppose that du(x) = f(x)dx for some f € L?(G). Show that 


n~ n~ 


dii(x) = f(x)dy and that f(x) > 0 almost everywhere. 
(iii) Functions of positive type. Among measures of positive type on a lcg, 


there are measures duu(x) = y(x)dx, where ¢ is locally integrable. y is then 
said to be of positive type. However, by (6), 


uG*f)= / flye)g(g)o(a)dardy = / v (yt) f(a)g@)dedy. 
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Hence the condition 
(30.9) fe (yz) f(x) f (y)dady >0 


valid for all f € L(G). As will be seen, a general procedure for obtaining 
continuous functions of positive type consists in starting from a unitary rep- 
resentation (H,U) of G and in considering a diagonal coefficient 


(30.10) p(x) = (U(z)ala), acH, 


of the latter. Then indeed, for du(x) = y(x)dza, 


whence 


(30.11) ug * f) = (U(fyalU (g)a). 


As the left hand side is, by definition, (f,,|9,), the representation of G de- 
fined by y(x)dx can be obtained up to equivalence in the following manner: 
H(1) is the closure of the subspace of U(f)a € H, operators L,,(x) being 
the restrictions of U(x) to this subspace. In particular, if the representation 
(H, U) is irreducible, it is equivalent to the representation defined by p. 


Lemma 1 (Gelfand-Raikov). Every locally integrable function y of posi- 
tive type bounded in the neighbourhood of unity is almost everywhere equal 
to a bounded continuous function. There is a unitary representation (H,U) of 
G such that p(x) = (U(ax)ala) for somea € H. It is unique up to equivalence 
if the linear combinations of the U(x)a are required to be everywhere dense 
in H. 


Here too, the result is obtained by using the unitary representation defined 
by du(a) = y(a)dx. Indeed, for f,g € L(G), 


(30.12) (fulg.) =n(G*f) = ff 2 yn) f(e)a(yaedy. 


By assumption, |y(x)| < M ae. in a neighbourhood of 0 which can be sup- 
posed to be of the form K~!K, where K is a compact neighbourhood of e. 
If f vanishes outside K and if we put g = f in (12), we get the inequality 


Wal <M [ |f(@)lae. 


Then, choosing functions f > 0 with integral equal to 1 and vanishing outside 
K, we suppose that the measure f(2)dx converges to a Dirac measure at e. 
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As the function 
fe (y~*a) g(y)dy = fe (y~*) g(ay)dy = 9 * 9(2) 


is continuous for g € L(G), (fy\g,) converges to the value of this function at 
x =e. Since ||f,,|| <M for the functions f considered, at the limit, 


[fel ye) Taha <M |\g,|| for all g € L(G). 


So there is a vector a € H(t) such that 


(algu) = fe (y~") g(y)dy 


for all g € L(G) and, replacing g(y) by g(xy), 
but then (12) becomes 


whence f,, = L,,(f)a for all f € L(G). So 


i few \aw)drdy = (Fuldu) = (Ly (f)alLy (ga) 


=} | eal x)a|L,,(y)a) f(«)g(y)dady 


for all f,g € L(G). We then deduce that 


g (y~'x) = (Ly(a)alL£,(y)a) = (Ly(y~*x)ala) 


almost everywhere on G x G. Thus 


p(a) = (Ly(x)ala) 


for almost all x. 

Conversely, if p(x) = (U(x)ala) for a unitary representation (H,U) and 
some a € H, H may be assumed to be generated by all U(x)a. Then, as was 
seen above, for the measure d(x) = y(x)dx and f,g € L(G), 


(30.18) (U(f)alU(g)a) = H(9* f) = (fulgy) - 


Therefore, there is an isomorphism from H(j) onto H which, for all f € L(G), 
maps f,, onto U(f)a. So the representation defined by z is equivalent to 
(H,U), which proves the lemma. 
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In particular if (H,U) and (H’,U’) are two unitary representations ad- 
mitting “generators” a and a’, and if (U(x)ala) = (U'(ax)a’|a’) for all x, then 
all given representations are equivalent. Corollary: two irreducible represen- 
tations having a common diagonal coefficient are equivalent. 

Supposing that y is continuous, it is clear that y(e) > 0, |y(a)| < v(e) 
and 


y (27) = g(a) 


for all x. Hence G(x) = A(x)y(z), ie. 6 = y when G unimodular. 
On the other hand, under the same assumptions, relation (13) shows that 
if a measure v of positive type is < p, then v(f * f) < ||U(f)al|? and so 


Iv (9* f)| < ]U(A/all lO (gall . 


The U(f)a being everywhere dense in 1, there is (n° 19, corollary of theorem 
31) a Hermitian operator H satisfying 0 < H <1 and 


v(g* f) = (AU(f)alU(g)a) . 


The left hand side being invariant if f and g are made to undergo the same 
left translation, H commutes with all U(f) and all U(x). Hence if (H,U) is 
irreducible, then H is a scalar and v is proportional to p. 

Conversely suppose that this condition holds for a measure p of the form 
y(«)dx, where y is continuous of positive type. If a Hermitian operator H 
on H, satisfying 0 < H < 1, commutes with all U(x), so does H'/? and the 
function 


vx (2) = (U(x) Hala) = (U(w) HY? Hala) = (U(w)HYalH"/?a) 


is of positive type, as well as yi_n = y— yu. Therefore, gy < y, and so 
yu = cy for some scalar c > 0. It readily follows that H is a scalar. 

Let us say that a continuous function of positive type is elementary if 
it is a “diagonal” coefficient of an irreducible unitary representation. The 
previous arguments prove the next statement: 


Lemma 2. Let y be a continuous function of positive type. py is elementary 
if and only if every continuous function of positive type w < vp is proportional 
to yp. Every measure js of positive type such that u< y is then proportional 
to the measure yp(x)dax. 


An easy calculation shows that, for any finite number of x7; € G and 
scalars €; € C, in notation (10), 


(30.14) wr, (ay) bib; a Scaw (xi) all 20. 


This is the original definition of (continuous) functions of positive type used 
in the 1930s by Salomon Bochner for G = R, and adopted by everyone, 
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in particular by probabilists studying stationary stochastic processes. The 
equivalence of (14) and (9) can be easily shown. Indeed, if K is a compact 
set outside which the continuous functions f and g appearing in (9) vanish, 
then K can be partitioned into finitely many measurable sets A; on which f 
and g are constant up to r > 0. Choosing x; in A; and denoting the extension 
of the integral of f (resp. g) to A; by &; (resp. 7), ¢ being continuous, the 
difference between integral (9) and the sum > (a; *xa)&in, can clearly be 
made to be arbitrarily small. Hence the result. 

For G = R, F. Riesz!!® was the first to see the relations between con- 
tinuous functions of positive type and unitary representations of R studied 
by Marshall Stone. This is where Gelfand and Raikov (1943) probably got 
the idea of generalizing the result to arbitrary lcg; this was my case a year 
later. This point of view cannot be found in Weil’s book, but it contains the 
generalization of Bochner’s essential result to all commutative Icgs: 


Theorem 45 (S. Bochner). A continuous function y on a commutative 
leg G is of positive type if and only if there is a bounded positive measure!?0 
d¢(x) on G such that 


(30.15) p(2) = [ele 04800 for all EG. 


The measure © is unique. 


Indeed, applying theorem 44 to the measure du(a) = y(x)dx, we get a 
measure ji > 0 on G such that 


| Fooa I(x) aji(x) = wf * 9) = fo (y-*x) f(x)g(y)dady 


for all f,g € L(G). To show that 7 is bounded, we put f = g and make the 
measure f(x)dx tend to the Dirac measure at e. The second integral tends 
to y(e) since y is continuous and bounded. On the left hand side, FOO? 
converges uniformly to 1 on every compact set while staying dominated by 
a constant. So the left hand side tends to ji(1). Thus fi(1) = y(e) < +00. 
To get (15), we make g(y)dy tend to a Dirac measure at e and f(x)dx to 
a Dirac measure at a point a € G. f(x) converges to e(a, x) and G(x) to 
1, while staying dominated by a constant function. This enables us to pass 
to the limit in the first integral. In the middle expression, the limit is y(a), 
giving (15). The converse consists in calculating u(f * g) from (15). 


Exercise 2. State the theorem explicitly for G = Z (The G. Herglotz 
“moment problem”) and G = T. 


19 Uber Séitze von Stone und Bochner (Acta Szeged, 1933, pp. 184-198) 
120 So the notation G does not denote a function, though, in some cases, the measure 
d(x) is absolutely continuous with respect to dy. 
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Exercise 8. (a) Show that the Fourier transform (in the sense of Plancherel’s 
formula) of a square integrablecontinuous function y of positive type is posi- 
tive, in L1?(G) and that 


y(e) = i ela, )O(x)ax. 


(b) Show that there is a unique function w € L?(G) of positive type such that 
p=wx«w. (c) Let (H,U) be the unitary representation defined by y (The 
Gelfand-Raikov lemma). Show that it is equivalent to the representation on 
Le (G: ®) associating the operator of multiplication by e(x; x) to each x € G. 
(d) Denote by H(~) the smallest closed invariant subspace of L?(G) contain- 
ing w. Show that (H,U) is equivalent to the representation by translations 
on H(w). 

Exercise 4. Let (H,U) be a unitary representation of G. Using Bochner’s 
theorem, show there is spectral measure M on GC in the sense of n° 22, such 
that?" 


U(x) = / e(r, x)dM (x) 


n 


for all x € G and U(f) = M(f) for all f € L'(G). What about the case 
G=Z2? 

Exercise 5. (a) Let G be a lcg and K Cc L™(G) the convex set of contin- 
uous functions of positive type such that y(e) < 1. Show that K is compact 
with respect to the weak topology of L°(G) (use lemma 1 and the fact that, 
in the topological dual of a Banach space, the unit ball is weakly compact). 
(b) A point of a convex set K is said to be extremal if it is not in the in- 
terior of any line segment contained in kK. Using lemma 2, show that the 
extremal points of K are precisely the elementary y such that y(e) = 1 and 
the function 0. 

In a locally convex topological vector space, for example L™(G) equipped 
with the weak topology, every compact convex set is the smallest closed convex 
set containing all its extremal points (Krein-Milman theorem). The previous 
exercise then enables us to show the existence of “many” elementary func- 
tions of positive type and even that, for all  € G such that x # e, there 


121 For G = R, it is Stone’s theorem referred to in the title of F. Riesz’s article 
mentioned above. It extension to the general case was independently published 
by M. A. Neumark (1943), W. Ambrose (1944) and myself (CR de l’Académie 
des Sciences, June 1944) at a time when, as I have already mentioned some- 
where, Soviet (or American) journals did not reach France. Mine is even now 
not cited, perhaps because it was published in CRAS, a journal little read by 
mathematicians. Some have criticized CRAS for not having referees. As most 
mathematicians are familiar with the distinction between a necessary and a suf- 
ficient condition, this argument does not prove anything: to decide whether a 
two page article is interesting or not, it should be read. Let me add that my note 
already contained much of the formalism of spectral measures given in n° 22. 
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is an irreducible unitary representation (H,U) of G such that U(x) 4 1: 
This is Gelfand-Raikov’s result and their proof. To go further, two methods 
are available. The general theorem for G can be used. Choquet’s theorem for 
compact convex: for all y € K such that y(e) = 1, there is a positive measure 
m on K, of total weight 1, concentrated on the set of elementary functions 
and such that 


p(x) = [ewmauc) for all x. 


This result generalizes Bochner’s theorem, except for the uniqueness of the 
measure ps. Otherwise, using von Neumann’s methods described in n° 24, 
one can show that any unitary representation of G is a continuous sum of 
irreducible representations. But this result from pure measure theory is of 
little interest. In concrete cases, one always expects to find a much more 
natural or canonical decomposition, not to mention the fact that, for some 
bizarre groups, a given representation can be decomposed into irreducible 
representations in two completely different ways. 


31 — Quasi-Regular Representations of a Unimodular Group 


(i) Central measures of positive type.?? As was seen in section (i) above, the 
regular representation of G by left translations 


(31.1) L(x): froex*f 


on L?(G) is obtained by applying the method valid for any measure pz > 0 
to the Diract measure ¢. But right translations defined by 


(31.1) Ria): fo f * es 


define a second representation on L?(G)!?3 commuting with the previous one. 
It is unitary because, G being unimodular, the measure € satisfies the identity 


(31.2) e(f *g) =e(g* f), 


equivalent to 


[fos (2~*) de = f o(2)f (2) der. 


122 For this section, see my Mémoire sur la théorie des caractéres dans les groupes 
localement compacts unimodulaires (Journal de math. pures et appliquées, XXX, 
1951, pp. 1-110). 

23 As fxex is the function y 4 f(y"), R(xy) = R(y)R(2), so that we get a repre- 
sentation of the “opposite” group, obtained by replacing the given composition 
law (2, y) + xy by (x,y) 4 yx. The map «+> R(a~') is the one that would 
define a representation of G. Denoting by G’ the “opposite” group to G, the 
map (z,y) + L(a)R(y) is, therefore, a— very particular — unitary representation 
of the group G x G’. 
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Then 
(R(x) flg) = €(G* f * ex) = € (ex *9* f) 
=e(g*ee1) *f]=(fIR(@"") 9) , 
whence the result. Moreover, L?(G) contains an “involution” 
Sifrof 
for which 
(31.3) SLs = Rae =): 


(2) says that the measure € is central. An immediate calculation shows that 
central measure are invariant under inner automorphisms x ++ sxvs~! de G, 
and conversely. 

Similarly, for any central measure pu of positive type, there are two unitary 
representations L,, and R,, and an involution on the space H(j1). Indeed the 
inner product (f,.{9,) = “(9 * f) is invariant under left and right translations 
frre,» f and fH f *€,; hence two representations L,,(x) and R,,(x). They 
commute like they do on L?(G). For the same reason, 


u(g*fy=u(f*g) (car p>0)=pn(g* f) =n(f*9), 


which proves that f f induces a quotient map and defines an involution S$ 
on H(,) such that 


(31.4) 9=F > = Sf, 


Relation (3) obviously holds in this setup. 

Let us denote by A(j) the image of L(G) under f + f,,. It is the quotient 
of L(G) by the left ideal I(:) consisting of f such that p(f * f) = 0. As pu 
is central, this ideal is stable under f ++ f, and so is a two-sided ideal!?4 
of L(G). This provides an algebra structure on A(j) = L(G)/I(). Hence, 
denoting multiplication in A(j) by a point to avoid any confusion, 


(31.5) fu-Gu = (f * 9) p> 
(31.6) S (fu-9u) = Sou Shu - 


Except in some cases, in general the f,, cannot be interpreted as functions 
on G. The symbol used to multiply them is, therefore, not a convolution 
product in the usual sense. 

Over a Lie group, L(G) could be replaced by the Schwartz space D(G) 
and the same constructions carried out for all central distributions of posi- 
tive type. We could even go further by considering positive Hermitian 


124 Tn a ring A, a two-sided ideal is a subgroup J of the additive group such that 
axb € I for alla,b € A and az € I. Relations x = x’ modI and y = y’ mod then 
imply zy = x’y’ mod I, defining a multiplication — a ring structure — on A/T. 
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forms (f,g) defined on a vector subspace a of L'(G) whose elements are 
“sufficiently regular” functions — whatever that means... — satisfying some 
obvious conditions:!?° 


(BT 1) a is invariant under the maps f 4 f, f > €, * f for all x and 
frig*f for all g € a [so that a is a subalgebra of L1(G)], 

(BT 2) a is everywhere dense in L1(G), 

(BT 3) the function pu(e, * f,g) is continuous for all f,g € a and 


(31.7) w(h* f,9) = i (ce * fg) A(a)ae 


for all h € L(G), 
(BT 4) p(f,5) = (9, f) for all f,g € a. 


There is no need to go further to find examples of such objects: take G = R, 
for a take the set W(R) [not contained in L(G)] of integrable functions whose 
Fourier transforms have compact supports as in n° 30, (ii) and set u(f,g) = 
f f(t)g(tdA(t), where \ is a positive arbitrary measure on R. If pu is a central 
distribution of positive type on a Lie group G, choose a to be Schwartz space 
D(G) and set p(f, 9) = w(g * f). 

In the general case, specifying a function p satisfying the stated conditions 
enables us to define a Hilbert space H(j), a canonical map f +> f,, from a 
to H(z), an involution S, two representations L, and R, of G and of the 
opposite group G’, and finally a multiplication in the image A(j) of a in 
H(). Like Dixmier, we will call such a Hermitian form p a bitrace on G. 


(ii) The commutation theorem. “Two-sided representations” of a unimod- 
ular group defined by bitraces have a property well-known (and trivial) in 
the case of the regular representation of a finite group which was proved in 
n° 29, (ii) for the regular representation of a compact group: 


Theorem!”6 46. Let 1 be a bitrace on a locally compact unimodular group G 
and L(s) [resp. R()] the algebra of continuous operators on the space H(j1) 
commuting with right (resp. left) translations. Then 


(31.8) L(m) = Rw)’, R(w) = L(p)’ 
and 
(31.9) SAS"! = A* forall AE L(p)NR(u). 


125 See R. Godement, Séminaire Bourbaki, Mars 1951, and Théorie des caractéres I 
(Annals of Math., 59, 1954, pp. 63-85), where spaces a of bounded measures 
are considered instead. The definition adopted here consists in replacing a by its 
intersection with L'(G) and is suffices for our purpose. 

126 For the regular representation, see I. E. Segal (Annals of Math., 51, 1950, 
pp. 293-298) and for the general case, R. Godement (J. de Math. pures et appli- 
quées, XXX, 1951, pp. 1-110) and notes in CRAS (November 1949). 
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Since A” = A for any von Neumann algebra [n° 19, (vii)], relations (8) 
are equivalent. As R() contains all R(x), it is clear that R(ju)’ D L(j4) and 
L()' D Rp). So it suffices to show that all A € L(y) commute with all 
Be R(wu). 

For f € a, the definition algebra of ww, consider the operators 


Lyu(f) = f Lula)sfle)de, yl) = f Rela) fle)ae. 
Formula (31.7), or (30.4) in the case of a measure, shows that, in notation (5), 


Lulf)gu = (Ff * 9) = fu-9p 


and similarly, 


Rulf)9u = (9* fp = Gu-Su- 


The assumption implies that A commutes with all R,(f) and B with all 
L,(f), and so that 


(31.10) AXGicki) Aga tay Boon) = ta Bon 


for f,g € a. The “obvious” proof of the theorem, used for compact groups, 
would consist in writing that 


BA (fu-Gu) = B(Afn-Gu) = Afp-Bau ; 
AB (fu-Gu) =A(fu-Boy) = Afp-Baou; 


but neither Af, nor Bg, are in general in A(j:), and a priori, their product 
is not well-defined, even if G = R and p =e. 

It is, therefore, necessary to use somewhat less simple arguments. The 
theorem being only a particular case of a much more general theorem with 
exactly the same proof, but more applications, we will work in the latter 
framework. 

We start from the following three data:!?7 


(i) an algebra A with or without unit element whose multiplication will be 
denoted by f.g, 

(ii) asemi-linear map (“involution”) S on A such that S$? = 1, 

(iii) an inner product (f|g) on A such that (f|f) =0 implies f = 0. 


7 The notation used (f,g,y, etc.) aims at highlighting the similarity with the 
regular representation case, where Roman letters denote functions of L(G), Greek 
letters a priori arbitrary functions on L? (G). For what follows, see R. Godement, 
Théorie des caractéres. I. Algébres unitaires (Annals of Math., 59, 1954, pp. 47— 
62) and a talk of March 1951 in séminaire Bourbaki. At the time, exchanging 
ideas with Jacques Dixmier was very helpful for me. See his article in Compositio 
Mathematica of 1952 and the two books already referred to. 
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Set || fll = (f|f)'/? and denote by H the Hilbert space completion of A, 
which, therefore, is H(j.) in the case of the theorem and L?(G) if = ¢. The 
map S can be extended to H. I will in general write Sf = f. 

A will be said to be a Hilbert algebra if it satisfies the following conditions: 

) Sf.Sg=S(g.f) for f,gEeA 

(AH 2) (f.g|h) = (glSf-h) for frg,he A; 
(AH 3) (flg) = (SglSf) for f,g € A; 
( ) for all f € A, there is an upper bound ||f.g|| < M(f)|lg|l; 

(AH 5) the products f.g are everywhere dense in A (hence in #). 
These axioms are satisfied by the algebra A(j1) associated to a bitrace yu. The 


first one is obvious. The second one can be written h*(f*g) =(fxh)*g. (AH 3) 
follows from (BT 4). (AH 4) is clear since f,,.9, = Ly(f)g, with a continuous 
operator L,,(f) on H(j1), as on any unitary representation. Finally, (AH 5) 
follows from the fact that, for any unitary representation (H,U), every « © H. 
is in the closure of the set of vectors U(f)a where f € L'(G). However, a is 
everywhere dense in L'(G). 

We next consider the general “abstract” situation. For all f €¢ A, multi- 
plication in A leads to maps 


(31.11) Lif)g = f.9, R(f)g = 9-f 


from A to H. By (AH 4), the maps L(f) extend to continuous operators on 
H. (AH 2) says that 


(31.12) L(f)* =L(f) for fe A. 
On the other hand, (AH 1), (AH 2) and (AH 3) show that 


SL(f)S-19 = S(f.9) =9.f = R(F)g, 
i.e. that 
(31.13) SL(f)S-" = R(f), 


and so all R(f) also extend to continuous operators on H. Trivially (associa- 
tivity), 


Lf)R(g) = R@)LP), Lfg = Rg)f 


for f,g € A. These operators enable us to consider H as a “bimodule” over 
the ring A: for this it suffices to set 


(31.14) fe=Lif\y, gv f=Rf)e for feA, pEH. 


Associativity follows from L(fg) = L(f)L(g), R(fg) = R(g)R(f). For the 
regular representation on L?(G) we thereby obtain convolution products, 
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and this example shows (if G is not compact) that, without assumptions on 
y and w, the product y.w of two elements of H is not well-defined (or, if it 
is from the point of view of group theory, does not belong to 1). 

To get theorem 46, it, therefore, suffices to prove the next result: 


Theorem 47. Let A be a Hilbert algebra and H its completion. If A and B 
are continuous operators on H such that 


AR(f)=RP)A, BL(f)=L(f)B 


for all f © A, then AB = BA. 


The first step of the proof consists in associating to each y € H a linear 
map R(y): A — H by setting 


(31.15) Riyp)f=Liffp=fye for fea. 
y will be said to be right moderate if R(y) is continuous, which the case of 
f € A by (AB 3), and the extension of R(y) to H will also be denoted by 


R(y). The left moderate y are defined in the same way. Hence in this case, 
there is a continuous operator L(y) satisfying 


(31.15’) L(y)f =R(f)y forall fe A. 


Lemma 1. [fy EH is right moderate, so is By and 


R(By) = BR(y). 


Indeed, for all f € A, 
BR(g)f = BL(f)e = Lf) Be. 
As B and R(y) are continuous, the map f +> L(f)By is continuous, so that 
Bg is right moderate, qed. 
Similarly, if y is left moderate, so is Ay and 


L(Ag) = AL(y). 


Lemma 2. [fy is right moderate, then so is p and 


Indeed by (15), for f,g € A, 


(R(v) fl) = (L(f)vlg) = (VIL(f)9) = (¢lf-9) = G-FIP) 
= (L(g) fl¢) = (FIL(9)) = (fIR(@)9)- 
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This proves the formula and shows that ¢ is right moderate. 
Similarly, 


L(g) = L(y)" 


if vy is left moderate. 


Lemma 3. Any right moderate p € H is left moderate and 


by (15’), so that ¢ is left moderate. Hence so is y = (~) by lemma 2, qed. 
Lemmas 2 and 3 show that the notions of left and right moderation are 
the same. Hence we will talk of moderate elements without any further detail. 


Lemma 4. [fy and w are moderate, then 


(31.16) L(p)y = Rv) ¢. 
For f € A, 
(Ly)¥lf) = WIL(P)F) (lemma 2) 


= (V|R(F)B) = (RPI) = (SL(A)S~'YI Sy) by (13) 
= (AIL(A)Y) = VIRW)S) = RW) elf) 


by lemma 2, qed. 
We are now ready to show that AB = BA. For f,g € A, Af and Bg are 
moderate by lemma 1. So 


AB(f.9) = ABL(f)g = AL(f)Bg = L(Af)Bg (lemma 1) 
=R(Bg)Af (lemma4) =BR(g)Af (lemma 1) 
= BAR(g)f = BA(f.9); 


(AH 4) then shows that AB = BA, qed. 
The equality 


(31.17) SAS" =A 


remains to be proved for any continuous operator A commuting with all 
L(f) and all R(f), i.e. belonging to the common centre of the von Neumann 
algebras £ and R associated to A. For f,g,h Ee A, 
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(SAS(F.9)Ih) = (RAGA) = (HALA = 
= (LAF) = (L(Q)hIA) = (R()gIAA) = 
= (I R(M)AS) = (gl AR()A) = (gLR(AR)f) (emma 1) 
= (glL(f)Ab) (lemma 4) = (L(f)gl Ah) = (f.gl Ah). 


Thus (17) follows since the f.g are everywhere dense. qed. 

Finally, note that the set of operators L(f) where f € H is moderate is 
a two-sided ideal Lz of the von Neumann algebra £ (the notation will be 
explained later). Indeed, lemma 1 shows that it is a left ideal. On the other 
hand, keeping the notation of lemma 1, 


R(y)B = [B* R(Sy)]" = R(B*Sy)" = R(SB*Sy) . 
The result follows by permuting left and right. 


Corollary. The set of L(f) [resp. R(f)], f € A, is ultrastrongly everywhere 
dense in the von Neumann algebra £L (resp. R) of continuous operators com- 


muting with R(f) [resp. L(f)] for any f € A. 


Indeed the theorem shows that £ = FR’, so that [n° 19, (vii), density 
theorem] the algebra of L(f) + A1 is ultrastrongly dense in £. To show that 
that so are the L(f), it suffices [n° 19, (vii), lemma 6] to show that, for y € H, 
for all f € A, relation L(f)y = 0 implies y = 0. As 


(L(f\ela) = (vlF-9) . 


this is axiom (AH 5) of Hilbert algebras, qed. 
Lemma 4 enables us to define in the set A of moderate y € H a multipli- 
cation extending that considered in A, namely 


(31.18) g =L(y)b = RW)p 
y.w can even be defined if only one of the factors is moderate: set 
(31.18’) pw=L(y)p or Rwv)yp 


according to whether y or w is moderate. This multiplication is associative 
in the set of moderate elements of H, which thereby becomes an algebra, and 
transforms H into a bimodule over it. Indeed if y and w are moderate, then 
for allOdEH 


(p.¥).6 = L(p.b)0 = L(L(y)v) 6 
=L(y)L(W)O@ (lemmal) =y.(w.0). 


In particular, L(y.) = L(y)L(w) and R(y.w) = R(w)R(y). On the other 
hand, the involution y+ ~ preserves moderate elements and by lemma 3, 


(v.h) = SL(v)b = SL(y)S"' Sp = R(P)b =H. 
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The algebra A nothing to do with a dual group or a spectrum — of 
moderate elements clearly satisfies conditions (AH 1),..., (AH 5), and so 
is a Hilbert algebra. Applying the previous construction to it does not give 
anything more. If the only moderate elements of H are the maps f €¢ A, A 
will be said to be a maximal (or full in Dixmier) Hilbert algebra. 

For example, let us take A = L(G), where G is commutative, whence 
H = L?(G). By Fourier transform, the moderate y € H are the functions 
for which the map f ++ @f, defined on the Fourier transforms of f € L(G), 
is continuous on L2(G). This obviously means that @ is bounded. So in this 
case, the maximal algebra A is the set of f € L?(G) with bounded Fourier 
transform. If unimodular G' is not commutative, then A is the set of ype 
L?(G) such that fH y* f (or f 6 f * vy: lemma 3) maps L?(G) to L7(G). 

If a Hilbert algebra A is irreducible, i.e. if # does not contain any non- 
trivial right and left invariant closed subspace, the algebra Z = LAR of 
operators commuting with all L(f) and all R(f) is reduced to scalars. As Z 
is the centre of £, algebras £ and R are factors in the sense of von Neumann 
theory [n° 19, (vii)]. 

Exercise 1. Let H’ be a closed subspace of H invariant under all L(f) and 
all R(f) and let E be the projection onto H’. Show that A’ = ANH’ = E(A’) 
is everywhere dense in H’ and is a maximal Hilbert algebra in H’ if A is 
maximal in H. 


(iii) Traces on a Hilbert algebra. Let us first consider a central measure 
du(x) = y(a)dx defined by a continuous function vy such that y(xy) = y(yz). 
This simple particular case rarely occurs except for commutative, compact or 
discrete groups. We denote the algebra of moderate elements of H(w) = H by 
A. There is [n° 30, (iii), proof of the Gelfand-Raikov lemma] a vector u € H. 
such that 


g(a) = (Ly(w)ulu), fu = Ly (fu 


for all « € G and all f € L(G), whence f,, = f,,.u by definition (18’) of the 
multiplication. It follows that R(w) = 1, and so that u is a moderate element 
of H(z). Since then v.u = R(u)v = v, the vector u is a right unit element of 
the algebra A, and since v.u = v implies u.v = 0, u is in fact a unit element 
of A and u = u. The most obvious example is that of the measure € on a 
discrete group G. As L(u) = R(u) = 1, for any operator A € £ and any 
FEA, 


(31.19) Af = A(u.f) = AR(f)u = R(f)Au = Au.f =af. 


The element Au = a € H defines A = L(a) and is moderate since A is 
continuous. The von Neumann algebra C is, therefore, the set of L(a), a € A. 
For all A € £, the formula 


(31.20) Tr(A) = (Aulu) = (alu) 
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defines a linear functional on £ with all the properties of a trace. Indeed, if 
A= L(a) and B = L(b) where a,b € A, then 


Tr (B* A) = (Au|Bu) = (a\b) = (6|@) = Tr(AB*) , 


whence 

(31.21) Tr(AB) = Tr(BA). 

Obviously 

(31.22) Tr(A*) = Tr(A), Tr(H)>0 if H*=H>0. 


If moreover 0 = Tr(H) = (Hulu) = ||H'/2ull?, then H!/? = L(H*/2u) = 0 
and so H = 0; trace (20) is then said to be faithful. 

If (H;,) is an increasing sequence of positive Hermitian operators on £ 
and is bounded above [n° 19, (v), lemma 8], then the definition of sup H,, 
shows that 


(31.23) Tr (sup H,,) = sup Tr(H) . 


Finally, it is clear that the initial function p(x) = (L,,(x)u|u) becomes 


(31.24) ele) = Tr{L,,(2) 
and that 
(31.25) (Fuld) -_ Tr[L(g)* Lyu(f)] 


for all f,g € L(G). Besides this is only a different way of stating relation 
(31.25) Tr [L(b)* L(a)] = (ald) 


obtained above as an aside. All this generalizes in an obvious way to uni- 
tary algebras with unit element and is directly connected to the theory of 
von Neumann algebras of finite type due to Dixmier and to their wonderful 
properties. !28 

In the general case of a maximal algebra A without unit element, the two- 
sided ideal £2 of £ defined above is the set of L(f), f € A. For A,B € La, 
in accordance with (25’), we set 


Tr(B"A) = (flg) if A=L(f), B=L(y), 


128 For example, Dixmier has shown that, if Z is the centre of such an algebra A, 
then there is a map T +> T* from A onto Z with the formal properties of the 
similar map with respect to a compact group, that T® is the only element of 
Z belonging to the closed convex envelope of the set of UTU~*, where U is 
unitary in A, and that the traces on A are the functions T +> y(T") where xy is 
a character of Z. 
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where f,g € A. The result only depends on the operator B* A = L(g.f) since, 
for u,v € A, 


(BY Aulv) = (L(G.f) ule) = (L(fJulL(g)e) = (Ru) fIR(@)49) - 


Choosing u, and v, such that R(u,)f and R(vp,)g converge to f and g in 
H (density theorem), (f|g) = lim(B* Auy|v,), whence the result. Tr(B* A) is 
clearly a positive Hermitian form on £2. Equipping £2 with this inner product 
and with the obvious involution gives a Hilbert algebra isomorphic to A, for 
good reason... As above, the relation Tr(AB) = Tr(BA) is equivalent to the 


formula (f|g) = (lf). 
One can also define the trace of an operator H in the set £4 of positive 


Hermitian operators on L. Set 

(31.26) Tr(H) = Tr (a?H?) if H¥/? € Ly, 
(31.26’) Tr(H)=+00 if HV? ¢ Ly. 

Tr(H) > 0 and Tr(H) = 0 if and only if H =0. 

Theorem 48. The function Tr(H) has the following properties: 


(a) Tr(UHU*) = Tr(H) for any unitary operator U € L; 

b) O0< H’ <@ implies Tr(H’) < Tr(A). 

c) Tr(H'+H") = Tr(H') + Tr(A”). 

d) If H, < Hy <... is an increasing sequence!?? bounded above in La; 
then 


(31.27) Tr (sup H,,) = sup Tr(H) . 


(a) The operator UH'/?U* is positive Hermitian and U*U = 1 since U is 
isometric. So (UH'/2U*)? = UH'/?U*UH'/2U* = UHU* and 
UH'/?U* = (UHU*)'” . 


This being so, we first suppose that Tr(H) < +00, whence H'/? = L(a) for 
some moderate a € H. Using relations UL(f) = L(Uf) and L(f)* = L(f) = 
L(Sf) of section (ii), it follows that 


UH/?2U* = UL(a)U* = L(Ua)U* = [UL(Ua)*]* 
= [UL(SUa)]* = L(USUa)|* = L(SUSUa). 


As SUSUa is moderate, UHU™* has finite trace and 
Tr(UHU—*) = (SUSUa|SUSUa) = (ala) = Tr(H) 


9 The argument equally applies to an increasing filtering family. See Dixmier’s 
book on von Neumann algebras. 
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since U and S are isometric. 

On the other hand, H = U*(UHU*)U. Conversely, if U and U* are iso- 
metric, in other words if U is unitary, then Tr(H) < +00 since Tr(U HU") < 
+oo, qed. 

(b) One may assume that Tr(H) < +00, and so H'!/? = L(u) for some 
ué A. As ||H"/? f\/? = (Aff) < (Af|f) = |lu-f|l?, there is a continuous 
operator A’ on H such that H’/?f = A’H'/?f for all f € A. Since this 
relation only determines A’ on the image of H'/?, image whose orthogonal 
is N = Ker(H'/2), A’ can be required to be trivial on NV. The operator 
A’ in then fully determined and satisfies H"/? = A'H'/? = H1/?,A”™ since 
H' = H". This relation shows that the image of H’/? is contained in the 
image of H'/?. Denoting its closure by M, i.e. the subspace orthogonal to 
N, one sees that A’ maps M to M and N to N. 

Let us show that A’ € L, i.e. that it commutes with all R(g), g € A. 


R(g)A'H'? f = Rig) Hf = H"/? Rg) f = AHP R(g)f = ARG) H 


holds for all f € A, so that R(g)A’ and A’R(g) coincide on M. As R(g) 
commutes with H, it preserves V; R(g)A’ and A’R(g) are, therefore, trivial 
on NV. Thus R(g)A’ = A’ R(g) as desired. 

This said, by lemma 1 about Hilbert algebras, H’!/? = A’H'/? = 
A'L(u) = L(A’u). As a result, 


Tr (H’) = ||A’ull? < |lull? = TA) 


since the norm of A is < 1, qed. 

(c) As H’ + H” = H is an upper bound of H’ and A”, the trace of H’ 
and H” may be assumed to be finite. So H!!/? = L(u’) and H'/? = L(u") 
for some u’, u” € A. As was mentioned above, there are operators A’, A” € L 
that vanish on the subspace N of vectors orthogonal to Im(H'/?), preserving 
the orthogonal M of NV, and such that 


(31.28) A/? = A’? = Ae am 2 — Al? — Fpl? ai 
By lemma 1, 
H! = HY? A 7? — A? AMT (uw!) = HY? (v') where vu! = Au’. 
Likewise, H” = H'/?L(v") , where uv” = A”*u". So 
H = H'/?L(v) where v = Au’ + Au". 


All vectors u’,...,v’’ are moderate, and so are in A. 
The previous relation shows that the operator 


(31.29) i H/2 = L(v) an Hi/2 _ Al H"/2 = All A/2 


is a zero of H!/?. Since M and N are invariant under H!/2, A’ and A”, 
hence also under their adjoint maps, so are they under T as well. Now, H'!/? 
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is injective on the orthogonal subspace M to Ker(H!/?). So H'/? = L(v) on 
M. The images of H’/? and H’/? being contained in M as was seen in the 
previous lemma, these operators act trivially on NV’, whence T = 0. 

Thus H'/? = L(v), which shows that H has finite trace, and by definition, 


Tr(H) = |lo|/?, Tr’) = lu’, Tr”) = |u|” . 

However, the relation H = H'/?(A’* A! + A’* A”) H'/? shows that 
2 2 
(31.20) Wr? = 14's +14" FI 
for all f of the form H'/2q, and so for all f € M. On the other hand, 
L(u’) = AH? = A'L(v) = L(A'v) 

(lemma 1), whence u’ = A’v by axiom (AH 5) of Hilbert algebras. Likewise, 
u’ = Av. Relation (30), therefore, proves that Tr(H) = Tr(H’) + Tr(H”) if 
M is shown to contain v. But M is the closure of the set of vectors H'/? f = 
L(v)w = R(w)v with w € A since L(v) is continuous, and we know that the 
unit operator is a strong limit of operators R(w), qed. 

(d) The first expression being greater than the second one, it suffices 
to prove the inverse inequality when the second expression is finite. Thus 
H,/? = L(an) with ay, € A such that |lan||? = Tr(H,). Set H = sup Hy. We 
know that (H;,) [n° 19, (v), lemma 8] converges strongly to H and [n° 22, 
(i), exercise 2] that H,/* converges strongly to H/?. So, for x,y € A, 


(H*/22|y) = lim (H}/?x|y) = lim (an.a|y) = lim (ay|y-£) . 


As sup ||an|| < +00 and as the y.# are everywhere dense in H, (a,) converges 
weakly to some a € H such that 


(H'/?2|y) = (aly.&) = (a.2ly) 


for all x,y € A. As H'/? is continuous, a is moderate and H'/? = L(a), 
whence Tr(H) = |la||? < sup ||a,||? = sup Tr(H,,), qed. 
Exercise 2. For all H € £4, there exists H’ € £4 such that 


H'<H, 0< Tr(H’) < +o. 


[Set H’ = H'/?L(f.f)H'/2 for some conveniently chosen f € Al. 

Exercise 3. Tr(A*A) = Tr(AA*) for all A € £. 

In general there are many other functions on £4 having properties 
(a),..., (d) of the statement: setting Z = LAR and 2, = ZNL4, any 
A€ Z, defines such a function, namely 


(31.31) Tra(H) = Tr(AH). 


This is well-defined since, A and H being positive Hermitian and commuting, 
AH € Ly [n° 22, (i), exercise 1]. If H'/? = L(a) for some moderate a € H, 
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then (AH)'/? = Al/?2H1/? — [(A'/2a), and since A‘/?a is moderate, it 
follows that 


Tra(H) = (A¥/?a|A‘/2a) < || All Tr(H). 


As shown by Dixmier, this gives all traces having properties (a),..., (d) of the 
theorem and that of exercise 3. Dixmier has also shown that any “reasonable” 
trace on a von Neumann algebra can be obtained in this manner from a 
unitary algebra; see this books. 


(iv) Case of a commutative group. In the case of a commutative Icg, one 
can add to theorem 44 thanks to the Fourier transform. We will confine 
ourselves to the case of the measure yz = €. The reader can easily address the 
general case by the same method. 


Theorem 49. Let G be a locally compact commutative group and T a con- 
tinuous operator on L?(G) commuting with the translations. Then there is a 
function T € L®(G) such that T is the operator f —> Tf. 


The set R(G) of these operators is a von Neumann algebra to which 
theorem 46 can be applied for ~ = €: since G is commutative, R(G) = 
R(G)'. Let M(G) be the image of R(G) under the Fourier transform. It 
it the commutator of the algebra A of operators M(p) : f +> pf, where 
p is a finite linear combination of characters of G. Hence M(G) = A’, and 
so M(G) = M(GY = A” using von N’s notation. Since A is a self-adjoint 
algebra containing 1, A is everywhere dense in M(G) with respect to the 
ultrastrong topology [density theorem, Chap. XI, n° 19, (vii)]. 

This means that, if (e;) is an orthonormal basis!2° of L?(G), then, for all 
r > 0, there is a linear combination p of characters of G such that 


2 
y \|Tex — peilly <r’. 
So there is as sequence of functions p,, for which 


Tei(x) = liam.” pp(x)ei(x) for alli. 


By the Riesz-Fischer theorem, for each 7, there is a subsequence for which 
the previous relation holds in the sense of convergence almost everywhere. 
As the set of indices i is countable, the same subsequence can be assumed to 
be suitable for all 7, in other words that 


Te;(x) = lim pp(x)ei(x) ae. 


for all 7. 


189 To simplify the proof, it will be assumed to be countable, which amounts to 
supposing that G is separable. 
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However, G is the union of sets {ei(x) # 0}, up to a null set, since 
otherwise there would be functions 4 0 on L?(G) orthogonal to all e;. As a 
result, the limit 


T(x) = limp, (x) 


exists almost everywhere. As Te;(y) = T(y)e;(x) ae. for all i, Tf(x) = 
T(x)f(x) ae. for any finite linear combination of e;. But any f € L?(@) 
is a limit in mean, hence also almost everywhere, of such combinations. So 
T f(x) = T(x) f(x) ae. for all f € L2(G). It remains to observe that, the 
operator T being continuous, ||T'||so = ||T'|| necessarily holds. 

The reader will easily show that if Tis positive Hermitian, then the func- 
tion T is positive and that Tr(T) = [ T(x)dy. 


(v) Characters of a locally compact group. Let p be a bitrace on a uni- 
modular leg G, (H(1), Ly, R,,) the two-sided representation of G and A(y) 
the maximal Hilbert algebra defined by yw. For all f € L(G), the image f, of 
f in H(w) is in A(z). For f,g € L(G), by definition, 


Hf, 9g) = (fulgn) = Tr[Ly(g)*Ly(f)] 


where Tr is the canonical trace (theorem 48) on the von Neumann algebra 
L(u) generated by the L,,(f) or L,,(x). w will be said to be a character of G if 
the algebra Z(j) of operators commuting with all L,,(f) and R,,(f) is reduced 
to scalars, in other words if the algebras C(j:) and R(j) are factors in the 
sense of von Neumann. The representation (1, y) +> L(x) R(y) of G x G on 
H() is, therefore, irreducible and conversely. 

As shown by von N., there are four large classes of factors. He distin- 
guished between them by investigating the closed subspaces € of H such that 
the corresponding projection Pg is in the algebra A considered, and by con- 
sidering equivalent two subspaces €’ and €” when there is an operator on A 
mapping €’ isometrically onto €”’. The equivalence classes of these subspaces 
are characterized by a number dim(E) € [0, +00] having the formal properties 
of a dimension. Then, up to a constant factor, the possibilities are as follows: 


Case (I,), n € NU {+co}: the dimension takes values 0,1,...,n; 
Case (Ij): it takes all values in the interval [0, 1]; 

Case (II..): it takes all values in the interval [0, +00] ; 

Case (III): it only takes values 0 and +oo. 


This classification applies to characters of a unimodular leg. Case (III) does 
not occur, but it is encountered in the regular representation of some non- 
unimodular groups. 

In the case of the algebra A = £(H), the von N dimension is the usual 
one and is just Tr(Pg), where Pg is the orthogonal projection onto € and 
where Tr(H) = >°(He;|e;) is the usual trace calculated in an orthonormal 
basis. Nonetheless, the notion of the trace of an operator hardly appears 
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in von N.’s articles. Partly influenced by representations defined by central 
measures of positive type (the case of compact groups was already indicative), 
Dixmier reinterpreted von N.’s theory in terms of traces and freed it from its 
restriction to factors. In the case of a factor, Dixmier showed that, up to a 
constant factor, there is a unique function Tr defined on Hermitian elements 
> 0 of A, satisfying theorem 48 and such that dim(€) = Tr(Pz). 

Like elementary functions of positive type [n° 30, (iii), lemma 2], char- 
acters can be defined by an “extremal” property. To see this, it suffices to 
observe that, for every bitrace yz, bitraces u’ < p correspond bijectively to 
operators H € Z() such that 0 < H <1, the bitrace defined by such an H 
being 


La(f,9) = Tr[Ly(g)" HL, (f)] - 


Conversely, every bitrace yu’ < js defines an operator H by 


(A ful 9) = uf, g)- 


The proof is the same as that of lemma 2 of n° 30. The conclusion that fol- 
lows is that a bitrace yz is a character if and only if every bitrace pi’ < p is 
proportional to 4, which is equivalent to saying that characters of G corre- 
spond to extremal generators of the convex cone of bitraces. In this general 
situation, there is unfortunately no analogue of the Krein-Milman existence 
theorem, let alone of Choquet’s theorem, on compact convex sets. 

Nonetheless, the theory of direct integrals of Hilbert spaces of n° 24 applies 
to the algebra Z(4) or to any Gelfand-Naimark subalgebra Z C Z(,1) such 
that Z = Z(u). Indeed, using separability assumptions, there is then a 
decomposition 


H(u) = i H(A), 


where integration is over the spectrum of Z. There then exist operators T(¢) 
in H(¢), defined up to a null set, corresponding to every bounded operator 
T commuting with Z, ie. with Z(). In particular, for almost every ¢, the 
decomposition of operators L,,(x), L,(f), Ru(x), Ru(f) and S, leads to sim- 
ilar operators L(x;¢), etc. in H(¢). Likewise, in almost all H(¢), there are 
algebras £(¢) and R(¢) generated respectively by the L(f;¢) and R(f;¢) 
corresponding to the algebras L(y) and R(j1) [n° 24, (ii), end]. The algebras 
L£(¢) and R(¢) are factors for almost all ¢ and the canonical trace Tron L(t) 
can be computed by the relation 


(31.32) 1) if TrT(C)T (GC) aX(C) 


where, for each ¢, a convenient function Tr is chosen on the factor £(¢). For 
all T = L,,(f) where f is in the domain of definition of the initial bitrace pi, 
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this gives or almost all ¢ a bitrace f +> Tr[L(f;¢)L(g;¢)*] f that is the 
character of G. The subject is dealt with in detail by Dixmier in chap. 8, 
17, 18 of his C*-algebras. See also Lajos Pukanszky, Characters of connected 
Lie groups (AMS, 1999) and Chap. 14, Abstract Representation Theory, of 
Nolan R. Wallach, Real Reductive Groups II (Academic Press, 1992). 

Case (II,) corresponds to irreducible Hilbert algebras with unit element 
or, in group theory, to central continuous functions of positive type studied 
at the beginning of section (iii) above. Consider for example the regular 
representation of a discrete group all of whose conjugacy classes are infinite. 

In case (II), thanks to his theory of induced representations, G. W. 
Mackey obtained examples for groups seemingly quite simple, the semidirect 
products!*! of commutative groups. 


(vi) Characters of type (I). Groups that have been by far the most stud- 
ied are those whose characters are of type (I): all linear algebraic groups 
fall in this case; so do real semisimple Lie groups with infinite centre, a fun- 
damental result proved by Harish-Chandra, and all solvable real algebraic 
groups, i.e. whose matrices are, if necessary by passing to C, simultaneously 
triangularizable (Dixmier, Kirillov, etc.); the others are, as a first approxima- 
tion, semidirect products of a solvable group and a semisimple group, which 
enabled Dixmier to obtain the general result. Actually, for these groups, char- 
acters can be shown to be distributions and, in the semisimple case, locally 
integrable functions, a very hard result of Harish-Chandra’s. 

Let A be a unitary algebra and H its completion. Assuming A to be 
irreducible, the von N. algebras £ and F are factors. Let us suppose they 
are of type (I) — as they are isomorphic by T -—> STS~, it suffices that 
one of them be so — and let Tr be the trace on R, normalized so as to take 
values 0,1,2,... We choose a projection P € R such that Tr(P) = 1 and let 
E = Im(P). Since P € R, the subspace € is invariant under T € L. These 
act irreducibly on €. Indeed, if a closed subspace €’ C € is £-invariant, the 
corresponding projection P’ is in R and satisfies 0 < P’ < P. So Tr(P’) < 
Tr(P), whence Tr(P’) = 0 or 1. In the first case, €’ is trivial; in the second, 
Tr(P — P’) =0. Thus P’ = P and €’ = €, which gives the result. It follows 
that the restrictions of operators T € L to € are everywhere dense in the 
algebra £(E) of all continuous operators on €. But as L is closed with respect 


131 G is said to be a semidirect product of two groups H and N if H and N are 
closed subgroups of G, if N is invariant in G, and if the map (h,n) +> hn is 
bijective. An easy example: the Euclidean group in R” is the semidirect product 
of the group of rotations about the origin and the group of translations. An 
example difficult to analyze: the group of transformations 


(z, 2’) + (e(at)z + u,e(a’t)z’ +u') , 


of C?, where t € R, u,u’ € C and where a, a’ are non-trivial real constants with 
irrational ratio; G is the semidirect product of R and the additive group C?. In 
fact, all possible “pathological” circumstances seem to be encountered in groups 
of this type. 


250 XI — Integration and Fourier Transform 


to all von N. topologies and preserves €, the set of restrictions of operators 
T € CL to E is identical to £(E). On the other hand, if T € CL is trivial on €, it 
is trivial on the closed subspace generated by A(€), where A € R. Now, this 
closed subspace is bi-invariant, and so is identical to H since A is irreducible. 
As a consequence, the map associating to each T € CL its restriction to € is 
an isomorphism from £ onto £(E). Since, there is a unique trace function on 
£, up to a constant factor, it is necessarily the usual trace function of L(E). 
In conclusion, for some T € L£, the condition Tr(T*T) < +00 means that the 
restriction of T to € is a de Hilbert-Schmidt operator. 

In the case of Hilbert algebras associated to groups, one readily deduces 
the next result: 


Theorem 50. Let ys be a character of type (I) of a unimodular group G, 
defined on a subalgebra a of L1(G). There is an irreducible unitary represen- 
tation (E,U) of G having the following properties: 

(a) U(f) = f U(x) f(x)dx is a Hilbert-Schmidt operator for all f € a, 

(b) for all f,g ea 


(31.33) uf, g) = TrlU(f)U(g)"] . 


This property shows that, from the point of view of harmonic analysis, in 
some sense groups of type (I) generalize compact groups. It would be wrong 
to infer conjectures optimistic about difficulties involved in their study (clas- 
sification of irreducible representations, explicit computation of characters 
and of Plancherel’s formula, etc.): fifty-five years of efforts (1947-2002) have 
not been enough to solve all these problems, in particular the first one, in the 
case of semisimple groups. 

The fact that real semisimple groups (Harish-Chandra, 1950-52) or over a 
local field (J. Bernstein, 1974) have property (a) of the theorem follows from 
a far more important theorem. For the sake of simplicity, let us suppose that 
the centre of G is finite. Then there are pairwise conjugate maximal compact 
subgroups in G,!°? and if K is one of them, one can show that, for any 
irreducible unitary representation (H,U) of G and any type 0 of irreducible 
representations of K, the multiplicity of 0 in the given representation of G is 
< dim(0). If y is a character of K, normalized by the condition that yx. = y, 
and if 


U(x) = / U(k)x(B)dk 


[n° 29, (iii), end, where this operator is written E(y)], then the subspace 
H(x) = Im(U(X)) has dimension dim(y)?, where dim(y) denotes the dimen- 
sion of the irreducible representation having character y. However, for any 
f € L1(G) such that 


182 Typical examples: the group SO(n) in SL,(R), the group SU(n) in SL,,(C), etc. 
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(31.34) = f(z) =x* fey’ (a )= ff seek X(k) x! (k )dkdk’ , 


U(f) =U(X)U(f)U(\’). The rank of the operator U(f) is, therefore, finite. 
But using the theory of compact groups of n° 29 it is easy to see that finite 
sums of functions of the previous type are everywhere dense in L(G). Hence 
it follows that the set a of f € L(G) for which U(f) is a Hilbert-Schmidt 
operator satisfies conditions of section (i) of this n°. This enables us to asso- 
ciate a character pu(f,g) = Tr[{U(f)U(g)*], obviously of type (I), to the given 
representation. 

Conversely, any character 4 of G is obtained in this manner. Indeed, let 
(H, L, R,S) be the two-sided representation of G' associated to yw. The map 
(x, x’) ++ L(x)R(2’) is an irreducible representation of the semisimple group 
G x G in which K x K is a maximal compact subgroup. The characters of 
K x K are obviously the functions (x, x’) -+ y(x)x/(a’) where x and y’ are 
characters of K, and for the corresponding projections, 


BG x) = ff LORE x(x W )dkal! = LOQR(). 


We then choose some x’ such that R(y’) # 0 and set € = Im(R(y’)). This 
closed subspace is invariant under the von Neumann algebra £. Denoting the 
restriction of L(x) to € by U(x), we get a unitary representation (€,U) of 
G. Obviously, E(y) = Im[L(x)R(\’)] holds for the latter. But as G x G is 
semisimple, the general result stated above implies that this subspace is finite- 
dimensional. We then deduce that that there are several operators T € £L 
whose restrictions to € are Hilbert-Schmidt operators, and as CL is a factor, 
this is enough to show that £, and so yp, is of type (1). 

Once again, I refer to Dixmier’s books for additional information or more 
general proofs. 


32 — Discrete Components of the Regular Representation 


Given a unimodular group G, the basic problem is to decompose the regular 
representation of G into a direct integral of irreducible representations. Such 
a decomposition can include a “discrete spectrum” and a “continuous spec- 
trum”. As in the theory of compact groups where the second one does not 
exist, the first one is obtained by using irreducible representations that can 
be embedded in L?(G), in other words the minimal left — or right (at choice) 
— invariant closed subspaces of L?(G). Such a representation is, by definition 
a discrete component of the regular representation. They are sufficient for the 
decomposition of the regular representation if G is compact, but do not exist 
if G =R. Hence it may be thought that they also do not exist in the general 
case. Using explicit calculations, one of Einstein’s assistants in Princeton, 
Valentine Bargmann, found!®? that the group SL2(IR) does have some, as 


183 Trreducible unitary representations of the Lorentz group (Ann. of Math., 48, 1947, 
pp. 568-640). The Lorentz group itself is isomorphic to the quotient of SL?(C) by 
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will be seen in Chap. XII, n° 16 and 30. In this case, the decomposition 
of the regular representation, i.e. Plancherel’s formula for G, contains both 
“Fourier” series and integrals. More precisely, Bargmann found that there 
are irreducible representations (H,U) some of whose coefficients are square 
integrable, and checked that they satisfy orthogonality relations similar to 
those of compact group theory. As shown by Harish-Chandra, these repre- 
sentations are also encountered in the case of semisimple groups having a 
maximal compact torus, for example the symplectic group Sp,,(R). They are 
realized on square integrable holomorphic functions spaces and are essential 
in his general Plancherel formula. They are also encountered in the case of 
semisimple p-adic groups and in that of associated adelic groups, which are 
legs but not Lie groups, and they exist for some nilpotent Lie groups.!°4 

In fact, Bargmann already knew, and Harish-Chandra insisted on this 
point in the general case, that it is essential to consider not only SLZ2(R), but 
its universal cover G, a group that cannot be embedded in a linear group. Like 
any Lie group, SZ2(R) is isomorphic to the quotient of its universal cover by 
a discrete subgroup Z of the latter’s centre. In this case, Z is isomorphic to 
Z, and so is infinite. 

More generally, let G be a lcg, Z its centre and (H,U) a unitary rep- 
resentation of G. If (H,U) is irreducible, there is a character a of Z such 
that 


(32.1) U(z) =a(z)1 forall zEeG. 
The coefficients 

(32.2) gas (0) = (U(a)alb) 

of (H,U), therefore, satisfy the functional equation 


its centre {1, —1}. Around the same period, using integral methods, Gelfand and 
Naimark obtained Plancherel’s formula for SL2(C) then for SL, (C); see my two 
talks in the Séminaire Bourbaki of 1948-49. Harish-Chandra quickly generalized 
their Plancherel formula to all complex semisimple groups. Bargmann was only 
using infinitesimal methods, which, at the time, assumed that some coefficients 
of the representations were analytic functions on the group; but like a good 
physicist, Bargmann did not let this“ detail” stop him. The existence of several 
“analytic” vectors, i.e. for which the function « +> U(zx)a is analytic, was 
shown shortly after (see Chap. XII, n° 26). Bargmann’s and GN’s articles are 
still worth reading. The explicit Plancherel formula for SL2(R), more difficult to 
obtain than for SL2(C), is also due to Harish-Chandra (1950) ; see Serge Lang, 
SL2(R) (Springer, 1985). 
C. C. Moore and J. A. Wolf, Square integrable representations of nilpotent Lie 
groups (Trans. AMS, 185 (1973), pp. 445-462). A Lie group G is nilpotent if 
there are closed normal subgroups Z1 C Z2 C ... C Zn = {e} in G such that, 
for all p, the image of Zp in G/Zp_1 is in the centre of G/Zp_1, i.e. if 


134 


eG & yeZ,—> aye tye Zp. 


Example: any closed subgroup of the of triangular matrices with diagonal 1 
(unipotent matrices). 
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(32.3) plwz) = y(a)a(z). 


If Z is infinite, they cannot be square integrable over G; but as |y(xz)| = 
\yp(x)|, they can be square integrable over the quotient group G/Z. The rep- 
resentation will be said to be square integrable if all its coefficients are square 
integrable mod Z. The set of classes of measurable solutions y of (3) such 
that 


(32.4) i |o(x)|? dz < +00 
G/Z 
is a Hilbert space L?(G/Z; a) with inner product 
(32.5) (tl) =f FleaCo)ae. 
G/Z 


Since Z is in the centre of G, left and right translationsf -— e, * f and 
fo f *e, clearly act on this space, and so does the involution f +— f. 
This gives a “two-sided representation” which will be seen to be identical to 
the two-sided representation defined by a central measure pu of positive type. 
One could call it the regular representation defined by a. All this has already 
been seen in the context of “central groups” [n° 29, (iii)]. 

For any f € L(G), the function 


(32.6) fale) = ‘i faz)alade 


obviously satisfies (3) and thereby gives all the continuous solutions of (3) 
having compact support mod Z. The proof, left to the reader, consists in imi- 
tating that of n° 15, (iv) related to the unit character. As the reader will also 
easily check by imitating standard integration theory, the space L(G/Z; a) of 
these continuous solutions with compact support mod Z is everywhere dense 
in L?(G/Z;a). All limit theorems valid for the usual spaces L?(G/Z), and 
especially Riesz-Fischer, can be applied to L?(G/Z; a). Finally, as in the case 
of central groups, the regular representation of G on L?(G) is clearly the 
continuous sum of its representations on L?(G/Z; a): this is relation (29.35). 
For f,g € L(G), 


(falda) = [ af | [ fealoleayeea(e2)dn des = 


= ‘| a(z)dz [ 2: dx i f (xz02)g(x22)dz2. . 


But integrating over Z then over G/Z amounts to integrating over G. As a 
result, 


(Folga) = f alz)dz iE flez)glalde = fF F(2)ale)de. 
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Hence associating the measure 44 on G defined by 


(32.7) n(f) = vi f(ejaleaz 


to the character a of Z, we at last get 


(32.8) (folga) = wg * f)- 


This shows that the obviously central measure yu is of positive type and that 
L?(G/K; a) is isomorphic to the space H() of section (i), the isomorphism 
transforming f, into f,, for all f ¢ L(G) being compatible with representa- 
tions of G on these two spaces and transforming the algebra L(G/Z; a) into 
the Hilbert algebra A(yu). Multiplication in the latter is defined by (31.7): 
fu-Gn = (f * g)y for f,g € L(G). A small calculation readily shows that, for 
fg € L(G/Z; a), this product is just the “convolution mod Z” 


(32.9) fugale) = faz ole =f, Howoly "ay 


Theorem 51 (Bargmann orthogonality relations). Let (H,U) be an 
irreducible unitary representation of a locally compact unimodular group G 
and Z the centre of G. Set U(z) = a(z)1 for all z € Z. 

(i) If the representation (H,U) has a non-trivial square integrable coeffi- 
cient over G/Z, it is square integrable. 

(ii) For such a representation, there exists a scalar X > 0 such that 


(32.10) Ve (U(g)alb) (U(g)a’|b’)dg = d(ala’) (b|b’) 


for all a,a’,b,b) EH. 

(iii) For all f € L(G), U(f) is a Hilbert-Schmidt operator everywhere 
dense in the Hilbert space Lo(H). 

(iv) The coefficients of two inequivalent square integrable irreducible uni- 
tary representations corresponding to the same character a of Z are orthog- 
onal. 

(v) Every square integrable representation such that U(z) = a(z)1 is a 
discrete component of the regular representation of G on L?(G/Z;a) and 
conversely. 


We start by observing that for any representation (H,U) of G, coeffi- 
cients (2) satisfy 


(32.11) Pa,b (y~'gz) aaa Lu (x)a,U(y)b(9) 


and 


(32.12) a,b(9) = ¥b,a (9~*) - 
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So, if G is unimodular, then 


(32.13) (Ya,b) = Pb,a+ 


Proof of (i) and (ii). Let us suppose that Ya’a € L?(G/Z; a) for a pair of 
vectors a,a’ € H. The set D of u € H such that y,,, has the same property 
is invariant by (11), and so is everywhere dense in H. We equip D with the 
inner product 


(32.14) (ulv)’ = (ulv) 1 (Pu,alPv,a) ) 


the right hand side being calculated in L?(G/Z; a), and let H’ be the com- 
pletion of D. For all « € G, U(a), is clearly unitary with respect to this inner 
product. If (u,) is a Cauchy sequence in D with respect to inner product (14), 
clearly, lim un, = u exists in H and the coefficients y corresponding to ordered 
pairs (u,,@) converge in mean in L?(G/Z; a). Now, they converge everywhere 
to Yu,a in G since (uy) converges to u in H. As a result (Riesz-Fischer), 


Lane? Puna = Pua in L?(G/Z;a). 


So u € D and as (un) converges to u in H’, D=H’. 

Since (ulu) < (ulu)’, the canonical injection J from H’ to H is continuous, 
hence has an adjoint J* :H— H'’ =D. JJ* :H => H clearly commutes 
with U(x), hence is a scalar, so that J* is proportional to an isometric op- 
erator. Thus its image is a closed invariant subspace contained in D, whence 
D =H. It first follows that Yuq € L?(G/Z; a) for all u € H, hence, applying 
arguments to u, also that yy, € L?(G/Z; qa) for all u and v, and then that, 
for given a, J* is unitary up to a constant factor. Hence so is J, which gives 
relation of the form 


(32.15) (Yu,alPu',a) = A(a) (ulu’) : 


For u = uw’, the left hand side does not change if a and u are permuted. Thus 
A(a)(ulu) = A(u)(ala), and so A(a) = A(ala) where A is a constant. Then (15) 
becomes 


(32.15’) (Yu,alPu’ a) =A (ala) (ulu’) c 


Therefore, (10) follows by 4(a|b) = (a + bla +b) —... 
Proof of (iii). Let (e;) be an orthonormal basis for H. Setting 


(32.16) yij(x) = (U(az)eile;) , 


for all f € L(G), 


(U(Peile;) = , F@)fu(«) = i if [ F@® fis(a)al2)dz. 


256 XI — Integration and Fourier Transform 


Thus 


(32.17) (U(feiles) = (Yisl fa) - 


By function (16), this is an inner product on L?(G/Z; a). However, (10) shows 
that the functions y,;; are pairwise orthogonal and have the same norm on 
L?(G/Z;a). Therefore, they are part of an orthonormal basis, up to a factor, 
and so 3> |(yij|%)|? < ||l|? for all p € L?(G; a); in particular 


(32.18) dol edles) < [I fall” < +00 


by (17). This shows that for all f, U(f) is an operators of HS. To prove 
that they are everywhere dense in the space of operators HS, it suffices to 
show that, for all S € £L2(H) and all r > 0, there exists f € L(G) such that 
||U(f) — Slo <r, where 


[Alla = (So [l4eil?)? = Tra")! 
is the norm on £2(H). This inequality written 


So llU (fer — Seill? <r? 


will follow from von Neumann’s ultrastrong density theorem [n° 19, (vii)]. 
To prove it, we first recall that as the commutator algebra of U(f) reduces 
to scalars, £(H) is the bi-ccommutator of the self-adjoint algebra of U(f). 
So the operators U(f) are dense in £(H) with respect to the ultrastrong 
topology [n° 19, (vii), lemma 9]. For all g € L(G), let us set a; = U(g)ej. 
Since )> ||a;||2 < +00, the map T > (> ||Ta;||2)2 is one of the seminorms 
defining the ultrastrong topology of £(#). So there exists f € L(G) such that 


(AU ger — SU(g)eilla = INU (fai — Sail|?” < r?. 


As U(f)U(g) = U(f*g), the operators U(f) enable us to approximate SU(g) 
in £2(H). However, for all S of type HS, the map T+—> ST from the space 
L(H), equipped with the ultrastrong topology, to the Hilbert space £L2(H) is 
continuous. On the other hand, we know that the operator 1 is the ultrastrong 
limit of operators U(g). So S can be approximated with operators SU(q), 
hence by operators U(f). Thus the U(f) are dense in £2(H). 

Proof of (iv). See theorem 41 on compact groups. 

Proof of (v). Let us suppose that (H,U) is irreducible and square inte- 
grable, choose some non-trivial a € H and associate the function Yu,q(“) = 
(U(x)u|a) to each u € H. By orthogonality relations, 


(32.19) (Gu,alPv.a) = (ala) (ule) . 


For given a, the set of functions y,,,. is, therefore, a closed subspace (a) of 
L?(G/Z;a). Replace u by U(x)u by replacing gu.a(g) by Yu.a(gz). Then the 
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map U'—> Yq transforms representation U of G on H into the right regular 
representation on L?(G/Z; a), so that (H,U) is a discrete component of the 
latter. 

Note that the orthogonality relations lead to a formula for convolution 
products mod Z of two coefficients: 


Meee / (U(eg)ula) (U (g-*) vla) dg = 


= | (U(g)ulU (#7) a) O@)alodg = 
= Xula) (v|U(x)~*a) , 


whence 


(32.20) Puja *Z Pu,a = A(ula) Pv,a ; 


The convolution of two functions of H(a) is again in H(a). 

Conversely, let E€ be a closed subspace of L?(G/Z;q) invariant under 
right translations R(x) and minimal. The proof reduces to showing that the 
functions 


(92.21) (wiRte)e) =f PO Maty = Fez ule) 


are square integrable mod Z for all u,v € €, and that for this to be the case 
so is one of the coefficients. 

Let A = L(G/Z; a) denote the Hilbert algebra associated to the measure 
a(z)dz on G and, as in n° 31, A be the corresponding maximal Hilbert 
algebra. It is the set of u € L?(G/Z;a) =H such that the maps 


R(u): fro fu, L(u): f-ouf 


from A to H are continuous with respect to the L? norm. Besides, as was 
seen in the proof of theorem 47 of n° 31, it suffices that this holds for one of 
them. If v, € A converge in H, by (9) the functions vp,.u = vpn *z u clearly 
converge everywhere and even uniformly on G. At the limit, we thus again 
get 


(32.22) v.u= R(u)v =v *z u 
for all v € H. Replacing v by ¥, (21) shows that 
v.u(g) = (ul R(x)v) 


for allu € A andv € H. The left hand side being in L?, the proof, therefore, 
reduces to showing that € contains a non-trivial wu € A. However, € being 
right invariant, the projection P of L*(G/Z; a) onto € commutes with R(u) 
for all u € A, hence also for all u € A. Then, for u,v € A, 
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PL(v)u = PR(u)v = R(u) Pv = L(Pv)u. 


As PL(v) is continuous, so is L(Pv), which shows that Pu € A for all v € A. 
Hence the moderate elements of € are everywhere dense in €, qed. 


Exercise 1. Show that € C A. 
Exercise 2. Let (H,U) be a square integrable irreducible representation 
such that U(z) = a(z)1. Show that, if the rank of A, B € L(H) is finite, then 


| Tr |[U (a) A] Tr[U (x) Bldx = » Tr(AB*) . 
G/Z 


Associate the function ya(x) = A7? Tr{U(x)A] to each A of finite rank. 
Show that A +> vy, extends to an isomorphism from the Hilbert space 
L2(H) onto a minimal closed bi-invariant subspace M of L?(G/Z;a) and 
that every isomorphism from (H,U) onto a minimal right (or left) invariant 
closed subspace of L?(G/Z; a) maps H to M. Show that if G is compact one 
recovers the subspaces L?(G; x) of n° 29, (ii). 

Exercise 3. (a) Show that a function yp € H = L?(G/Z;a) is of positive 
type if and only if the operator L(y) : A —> H is positive symmetric. Assume 
this condition holds in what follows. (b) Let 


+00 
z= | dM (A) 


be the self-adjoint canonical extension of S = L(y) [theorem 35 of n° 23, (ii)], 
whence S = H in A. Let w be a compact set contained in R4. and M(w) the 
corresponding spectral projection [notation of n° 22, (iii)]. Show that M(w) 
belongs to the von Neumann algebra £ and that M(w)L(y) is continuous. 
Deduce that y’ = M(w)y is moderate and that 0 <« y’ « y. (c) Assume that 
y is continuous and elementary [n° 30, (iii)]. Show that y is moderate, that 
*z ~ is proportional to y and that L(y) is proportional to a projection. Let 
E = Im(L(v)). (d) Show that the representation of G on € by right trans- 
lations is equivalent to the irreducible representation associated to y [n° 30, 
(iii)]. (e) Show that all elements of € are moderate and that the function 
(R(x)ulv) is in L?(G/Z;a) for all u,v € E. Deduce the orthogonality rela- 
tions for any irreducible representation having a square integrable diagonal 
coefficient.13° 


'85 Theorem 51 has a curious history. Immediately after having read Bargmann’s 
article, I published a proof of it (CRAS, September and October 1947) which as- 
sumes Z is finite using moderate functions in L?(G) and theorem 35 of n° 23; this 
is the one proposed in exercise 3. In 1956, Harish-Chandra ((Euvres completes, I, 
pp. 90-107) generalized the result to semisimple Lie groups without any assump- 
tions on Z and supposing that a not necessarily diagonal coefficient is square 
integrable. He refers to my notes and says that my proof could perhaps be mod- 
ified so as to apply to the case of an infinite Z. His proof uses very special 
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Exercise 4 [Dixmier, C*-algebras, n° 13.8]. More generally show that if 
a function y € L?(G/Z;a) is continuous and of positive type, there is a 
function w € L?(G/Z;a) such that yp = bez w. What about the case of a 
commutative group? 


properties of semisimple groups, but his far harder aim was the explicit com- 
putation of the characters of the representations considered. Proposition (i) of 
the theorem can also be proved using the closed graph theorem; see for exam- 
ple A. Borel, Représentations de groupes localement compacts (Springer, Lecture 
Notes, 276, 1972, pp. 56-57), Dieudonné XXI.4, exercises 5 to 9 and N. Wallach, 
Real Reductive Groups (Academic Press, 1988), chap. I, which do not mention 
the inventor of the method. 
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§ 1. Infinite Series and Products in Number Theory 


1 — The Mellin Transform of a Fourier Transform 


There are more interesting relations between Mellin and Fourier transforms 
(Chap. VIII, n° 13) than those arising from a trivial variable change in the 
integration. At the very least, they apply to all functions f defined, continuous 
and integrable over R (with respect to the usual measure dx) and whose 
Fourier transforms are themselves integrable, in other words to functions f € 
F(R). The Fourier inversion formula [Chap. VII, § 6, theorem 26 or Chap. XI, 
n° 27, (iv), theorem 41] applies to these functions. Setting d*x = da/|x| as 
before, one can associate functions 


(1) 76s) = f feilaltare, FH) =f F@lel sen(ayere, 


to such a function as well as to its Fourier transform. These are in fact 
the Mellin transforms of the functions f,(%) = f(x) + f(—ax) and f_(x) = 
f(x) — f(—) and they are equal to 2I';(s) (resp. 0 ) if f is even (resp. odd) 
and conversely. As f is continuous, its convergence in the neighbourhood 
of 0 is guaranteed for Re(s) > 0; and as f is integrable with respect to 
dx, convergence at infinity is guaranteed if the function |x|*~! is bounded 
there, hence for Re(s) < 1. The functions (1) are, therefore, defined and 
holomorphic at least on the strip 


(1.2) 0 < Re(s) <1. 


If f is in the Schwartz space S(R), f, and f_ are also in S. So their Mellin 
transforms are meromorphic on C, possibly with simple poles at the points 
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n <0, and are rapidly decreasing at infinity on any vertical strip of finite 
width (Chap. VIII, §3, n° 13, theorem 14). In fact, the functions I’?(s) and 
iz 7 (s) have simple poles at even negative integers in the former case and at 
odd ones in the latter one: consider the asymptotic expansions of f, and f_ 
at the origin. 

The basic result is that, for f € F1(R), the ratios 


Tp (1— s)/T?(s) and T(1—s)/Te(s) 


are independent of f and can be explicitly calculated. This can be generalized 
to much more complicated situations than that of the very trivial field of 
real numbers and arises out! of John Tate’s Ph.D. thesis; around 1950, in 
Harvard, he used calculations of this type to study zeta functions of algebraic 
number fields in the “adelic” framework. There are also non-commutative 
generalizations, for example by replacing the field _R by the ring M,,(R) of 
real n xX n matrices, the multiplicative group R* by the group GL,,(R) of 
invertible matrices and the function « +> |z|* by g +> |det(g)|* in the 
simplest case. 
To unify calculations, it is useful to temporarily set 


(1.3) x(2) = |2|* or |a|* sgn(x) 


according to cases. Hence y(xy) = x(x)x(y), and 


(1.4) T;(x) = ) fla)x(a)d*e 


where integration is over R*. We will also set 


x*(«) = x(a) ||, 
which corresponds to the shift from s to 1— s. 
Therefore the result we have in mind means that, for all f,g € F(R), 
(1.5) D's (x") g(x) = Lg (x*) Le): 


0 < Re(s) < 1 needs to be assumed, but the result holds in all of C by analytic 
extension if f and g are in S(R) and even under broader assumptions. 
A formal calculation reduces the proof to showing that 


[freoame @xweaay=[f ef @rxtw)araay 


The change of variable y ++ xy in the integration with respect to y trans- 
forms relation (*) into 


Jf roa )x(ypaeary = ff o(x)f (y)ded*y. 


1 Like the method used in § 3 for the series L(s; x). 
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So showing that 
(**) if f(a)g(ey)de = i ola) fey) de 


is sufficient, but we have known this relation for a long time.” 

These operations remain to be justified. Since, for 0 < Re(s) < 1, 
f(x)x*(a) and g(x)x(a) are integrable with respect to d*x, the product 
f(x)x*(x)G(y) x(y) is integrable with respect to the product measure d*xd*y, 
and so is the function obtained by permuting f and g. Hence it is possible 
to compute both sides of (*) by first integrating with respect to y for given 
x, which entitles us to make the variable change y +—> xy, then with respect 
to «: Lebesgue-Fubini theorem. 


Relation (5) being proved, the ratios I’? (1 —s)/T(s) and Py(1—s)/T5(s) 
remain to be explicitly calculated assuming f even in the former case, odd in 
the latter one. The same then holds for 7 and the functions I?, etc. reduce 
to I’, up to a factor of 2. The easiest is to choose a particular function g 
whose Fourier transform is known. 

In the first case, we can for example choose 


(1.6) g(a) = (1+ a?) , and so g(y) = mexp (—2z|y|) 


by (8.12) of Chap. VIII for w = 1. Here it is necessary to assume 0 < Re(s) < 
1 in order to get convergent integrals in the formulas 


T3(s) = 20 ae exp(—27a)a*d* ax. 
The first integral is just function (15.2) of Chap. VIII for 1— s. So by VIII, 
(15.10), 
(1.7) I} (1 — s) = 1/cos(1s/2) . 
The change of variable x +> x/27 shows that on the other hand 
19(s) = (2n)!-*F(s) 


by definition of the I function. For this choice of g and for y(a) = |2|*, 
f € F', relation (5) becomes 


SATE) 7 (8) 


(1.8) (20)'*I'(s) Fe (1 — 8) = 


? Relation (**) not being exact if integration is restricted to be over x > 0, this 
calculation shows that in the definition of Mellin transforms used here it is nec- 
essary to integrate over R* and not simpy over R4 as in Chap. VIII. 
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If f is even, I’? and I ? can be replaced in the formula by the Mellin transforms 
I's and I iz This result assumes 0 < Re(s) < 1 but by analytic extension 


remains valid if the Mellin transforms of f and f are meromorphic on C, for 
example if f € S(R). When one obtains such a formula, it is prudent to at 
least check that both sides have the same poles. 

We could have also chosen the function 


(1.9) o() = exp (—na2) = 6(2) 
(Chap. V, n° 25, example 2 and Chap. VII, n° 28). In this case, the change 


of variable rx? = y for which d*x = $d*y and « = (y/m)2 gives 


P(x) = 7 et aS d* x = io en (y/n)s? dty = 17 8/?T'(s/2). 
Formula (5) then becomes 
(1.10) 9 w8/?D(s/2)P(1-s) =a O-PP [1 — 8) /2] re(s). 
Applying it to function (6), one deduces that 
a—*!2P'(s/2)n/ cos(ms/2) = 1~9-9)/2P [(1 — 8) /2| (2m) -8T(s). 
As relation ['(s)I(1 — s) = 1/sinzs shows that 
n/cos(ws/2) =I [(1— s) /2]P[(1 +s) /2] , 
(10) is equivalent to 
(1.11) I'(s) = 17 225"1(s/2)F [(1 +8) /2] . 


This result is itself equivalent to the duplication formula 


of Chap. VIII, (10.5.9) and we thereby get a more natural proof of it. 

Relation (10) has the benefit of being perfectly symmetric in s and 1 — s. 
Multiplying both sides by I" (5 +s/ 2) and taking account of the duplication 
formula and Euler’s relation, it can (exercise!) be expressed in the simpler 
form 


(1.10’) I? (1 —s) = 2(2n)*' sin(rs/2) (1 — s) Ie (s) ; 
In the case of I" 7 (8), choose for example 


g(x) = xexp (—m2”) = —g(—2). 
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This is more or less the derivative of the Gaussian function, so that 


9(y) = —iy exp (—7y”) = —ig(y) - 
Thus 


+oo 
Ps) = ‘f exp (—2?) |a|Stld*a = a +)/271(1 + 8) /2] . 


As I; = —iIj, this time 
(1.12) -in GDP ((1+s)/r1-s) =r? 9P 1 — s/2)P3(s) 
or 
P (1+ s)/2 Pp — 8) = im? 2 P(1 — 8/2) 2(s). 
Multiplying by '[(1 — s)/2], by the relation on complements and (11), 
Tp (1-8) = én**/? cos(ns/2)P(1 — s\n? BT H(s). 
Thus finally the formula 


(1.127) I; (1 — s) = 2i(2m)** cos(ms/2) (1 — s)I';(s) 

similar to (10’). The poles s = 1,3,... of ['(1 — s) are canceled out by the 
zeros of cos(7s/2), so that the left hand side has poles at s = 2,4,... as it 
should since f is odd. 

Exercise. Let S’ be the space of tempered distributions on R (Chap. VII, 
§6, n° 32). (a) Find a reasonable definition of meromorphic functions with 
values in S’. (b) For non-negative integers s, define the distributions |2|*d* x 
and |a|* sgn(a)d*x and compute their Fourier transforms. (c) Regarding s —> 
|x|*d*x as a meromorphic function with values in S’, calculate its residues at 
0,-1,... 

Finally, we could choose the function 

f(z) =1/ chaz = f(x) 
of Chap. VIII, n° 15, but the outcome would be quite different. It is obviously 
in the Schwartz space. As it is even, it suffices to calculate 


+oo sq* —T@ 7,8 J* 
ry(s) =2 f ee 
0 


eT 4 e-Tu 1 +4 e72ne 
= 2 fematde (aye = 
= 23-0" fecerrratate = 
= 2n-*I'(s) 5 (-1)"/(2n+ 1). 


N 
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TL 


The first series expansion is justified since e < 1, and so is the permutation 


with the integral for Re(s) > 1 because 


> f [core eros 


(Chap. XI, n° 4, theorem 5, a result that will be constantly used in this type 
of calculations). 
The series obtained can also be written 


(1.13) L(s,x) = ¥-(-1)"/(2n +18 = © x(n) 


where the “character mod 4” x, unrelated to functions (3), is given by 


d*x < +00 


1 if n=1 (mod 4) 
(1.14) x(n) = 0 if n=0 or 2 (mod 4) 
—1 if n=3 (mod4). 


It resembles the Riemann series ¢(s) = }>1/n° . Relation 
(1.15) I;(s) = 20(s)m *L(s; x) 


shows that the function L(s; x) = I'(s)/2I'(s)a* extends analytically to all 
of C like I'y(s), which has simple poles at the points s = —1,—2,—5,... As 
1/I'(s) is an entire function with zeros at these values of s, the series L(s) 
is the restriction of an entire function to the half-plane Re(s) > 1. And as 
2r, = TP = Fy since f is even and identical to its Fourier transform, (10) 


leads to a functional equation for L(s), which can, as usual, be simplified by 
some gamma function calculations: the function 


(1.16) (1/4) OF 9 [(s + 1)/2] L(s;x) 


is invariant under s +~ 1 — s. This result will be generalized in §3, n° 10. 


2 — The Functional Equation of the ¢ Function 


This topic has already been addressed in Chap. VIII, n° 18, but let us present 
it again for the convenience of the reader. We start with the Poisson summa- 
tion formula > f(n) = SD f(n), which is at least valid for all f € S(R), and 
apply it to the function y+ f(x~1y) for given x 4 0. A trivial calculation 


then shows that 


Hence, setting 


(2.1) 0; (x) = So f(ne) = 6;(—2), 


n#0 
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(2.2) f(0) +63 (~*) = f(0)|e| + [2/64 (2). 


We are going to deduce a relation between Mellin transforms of 6} and of OF, 
but first their behaviour at the origin and at infinity need to be studied. 


Lemma 1. For all f € S(R), the function 0% is C°° on R* and rapidly 
decreasing at infinity and so are all its derivatives. 


As, for all N €N, there is an upper bound |x f(a)| < ey valid in all of 
R [the left hand side is continuous on R and bounded for large |a|, hence on 
R], it follows that, for N > 2, 


[9% (a)| < en 55 |na|-% = cyl|a|-%. 


Hence it is rapidly decreasing at infinity. To show that 6; is C™, it suf 
fices to prove that the derived series 7 n? f) (na) = a~? 7 gp(nx), where 
gp(x) = a? f) (x), converge uniformly in every compact subset of R*, which 
is obvious since g, € S(R). The previous calculation then shows that the 
first derived series of 0; (x) and hence, by induction, the following ones, are 
rapidly decreasing at infinity, qed. 

Formula (2) now gives the behaviour of 6} in the neighbourhood of x = 0, 
namely 


97(x) = f(0)|2|-* — f(0) + O (al) 


for all N by lemma 1. This can be written as an unlimited asymptotic expan- 
sion whose only non-trivial trivial terms are the first two. Hence the methods 
and results of Chap. VIII, n° 13 can be applied to the Mellin transform 


+00 
(2.3) é¢(s) = | O;(x)x*d*z, Re(s) >1. 


It is holomorphic on the indicated half-plane; it extends to a meromorphic 
function on C whose only singularities are simple poles at s = 1 and s = 0, 
the corresponding residues being 


(2.4) Res (€7,0) =—f(0), Res (€p,1) = f(0); 


finally €¢(s) and all its derivatives are rapidly decreasing at infinity on any 
vertical strip of finite width. 
Moreover, we have 


(2.5) Er(s) =€-(1— 8). 


To see this, let us consider the entire function 


+00 
(2.6) EF (s) - ih O;(x)ard*x. 
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For Re(s) > 1, by (2), 


all the integrals being convergent. Hence finally 


(2.7) £7(s) = &} (s) era s) [20 io 


Ss l-s 


As f(x) = f(—a), replacing f by f and s by 1 — s clearly permutes the first 
two and the last two terms on the right hand side, whence (5). Relation (7) 
confirms that €(s) extends analytically to a meromorphic function with at 
most simple poles at 0 and 1. 

However, for Re(s) > 1, 


j= f- fira)lattee= Sf = 


n#0 n>1 
pos “ft )arlPd*ar = F9(s)C(s), 


where I" ? is defined by (1.1) and where ¢(s) = 5 1/n® is the Riemann func- 
tion. To justify these formal calculations, showing 


Xf lelne)) esl are < +00 


is sufficient, but this is clear for Re(s) > 1. 
This being proved, relation (7) becomes 


(2.8) T?(s)¢(s) = Fe(1— s)¢(1 — s) 


provided both sides are defined by analytic extension since without relation 
(7), the left hand side would suppose that Re(s) > 1 and the right hand one 
that Re(s) < 0. 
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Theorem 1. The function 
(2.9) €(s) = 1-9/7 (s/2)¢(8), Re(s) > 1 


extends analytically to a meromorphic function on C whose only singularities 
are simple poles at s = 1 and s = 0, where its residues are equal to 1 and 
—1. It satisfies the functional equation 


(2.10) é(1—s) =€(s). 


The ¢€ function extends analytically to all of C, except for a simple pole at 
s =1 where 


(2.11) Res(¢,1) = 1. 


¢(s) =0 for s = —2,—4,.... 


The assertions about €(s) follow from properties of €; by choosing 
f(x) = exp(—mx?) = f(a), whence I';(s) = 427*/?1'(s/2). On the other 
hand, thanks for example to the infinite product expansion of Euler’s func- 
tion, we know that 1/I°(s) is an entire function of s. We also know that the 
only singularities of ’(s) simple poles at points s = 0,—1,—2,..., which are, 
therefore, simple zeros of 1/I’(s). In 


8/2 


G(s) = Fea Sl)> 


the simple pole of €(s) at s = 0 is canceled out by the simple zero of I'(s/2) 
at this point, so that ¢(0) 4 0. The fact that ¢(s) = 0 for s = —2,—4,... 
follows from the analogous property of I'(s/2). At s = 1, '(s/2) = 2, and 
as € has a simple pole with residue 1 at this point, in the neighbourhood of 
s=1, 


¢(s) = [142(s—1) +.. ][I/(e—1)+..]. 


Hence the existence of a simple pole and relation (11), qed. 
Exercise 1. Show that 


C(1 — 2n) = —bo,/2n 


for n = 1,2,..., where the bz, are the Bernoulli numbers of Chap. VI, n° 12. 
[Use functional equation (10), relation (13.7) 


2¢(2n) = (—1)"*!(2m)?"ban//(2n)! 


of Chap. VI, formula 
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and the duplication formula of the I function]. 
Exercise 2 (Riemann). Show that I'(s)¢(s) is the Mellin transform of the 
function 1/(e’ — 1) and that 


(0+) 


2ni¢(s)/(1 — s) =| (e77 — 1)" ede 


—Co 


for all s € C, where integration is over the same path as for the Hankel 
integral for the I’ function (Chap. VIII, n° 10, (iii)). What is the connection 
with previous exercise? 

As all authors of textbooks on analytic function theory remind their read- 
ers, Riemann, the first to have proved theorem 1 using the Jacobi function® 


O(ix) = os exp (—1n?z) , 


conjectured in 1859 that the only non-trivial zeros (i.e. other than —2, —4,...) 
of his function are located on the line Re(s) = 5. For more than a century, 
many eminent mathematicians have made significant efforts to prove this. 
Amateurs, who have written thousands of wrong proofs of Fermat’s theorem, 
do not tackle Riemann’s hypothesis — too technical, whereas Fermat’s theo- 
rem can be regarded as elementary arithmetic ...—, neither do professional 
lacking unlimited confidence in their own ability.* For a long time, it was 
hoped standard methods of analytic function theory would work. They have 
led to significant results, but not to the result. For example, G. H. Hardy 
showed in 1914 that the ¢ function has infinitely many zeros on the “critical 
line” Re(s) = 3, which could explain his comments on the intellectual level 
of mathematics applied to artillery and his conviction that his mathematics 
will not be useful as it neither tends to accentuate the existing inequalities in 
the distribution of wealth, [n]or promotes directly the destruction of human 


life. 


See my Postface to vol. II. In an article on mathematics in Le Monde on 
May 25th, 2000, exalting applications without any discriminations, by a curious 
coincidence, I found the following passage: 


Thus, one should reflect on the unfortunate comments of the English 
mathematician Godfrey Hardy, who considered that real mathematics 
would never have any military applications. If only because, through cryp- 
tography, the theory of prime numbers plays an essential role in the world 
of intelligence and in that of the internet. 


These comments seem to me to be far more “unfortunate” than Hardy’s dec- 
laration. 


3 Generally speaking, we set 0(z) = )> exp(ain?z) for Im(z) > 0. 

“In his youth, André Weil hoped to prove it before its centenary date and to 
publish it in 1959. He burst out laughing the day when, after the publication 
of his complete works, I remarked to him that if he finally found a fifteen page 
proof, Springer-Verlag would be obliged to add a very thin volume to its edition. 


5 


io) 


NI 
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(1) Though Le Monde twice dedicated two pages to “the year of mathe- 
matics”, the cited passage is the one and unique reference to their military 
use. Is this topic so irrelevant — or so dangerous? — that it is only worth five 
ironic lines? 

(2) M. Augereau forgets to mention the word “directly” that Hardy had 
prudently employed, which enables him, as well as most commentators, to 
get away with ridiculing him. When Hardy wrote this in 1915, the chemistry 
of toxic gases (chlorine, phosgene, mustard gas, etc.) was starting to play a 
direct role in warfare thanks to future German Nobel laureates (Fritz Haber, 
James Franck, Otto Hahn, Gustav Hertz) and to dozens of less well-known 
chemists, who were readily imitated by their French and British colleagues.” 
Haber’s initiative caused an enormous scandal; its author, fearing he would 
be judged by the Allies, temporarily fled to Switzerland in 1918. In 1919, 
he received the chemistry Nobel prize for his pre-1914 research on the direct 
synthesis of ammonia, which was doubly scandalizing since the first use of the 
new technique, developed by BASF in 1912, saved Germany from an armament 
crisis® that would have been fatal for the country long before 1918. To this 
day, public opinion continues to disapprove the use of these armaments, gas 
warfare having given rise after 1918 to the first “anti-Science” comments in 
some circles. Grass-root veterans of my youth, knowing very well that they and 
their German counterparts had been in the same drunken boat together, did 
not feel any hatred for them; but according to them, Haber should have been 
shot in 1918. 

People who ridicule Hardy should learn about the context. Haber’s chem- 
istry is altogether a different matter from using the Arabs’ trigonometry to 
locate enemy batteries or inventing a cryptography system allowing the mili- 
tary of all nations to protect their little secrets... 

(3) That the new cryptography is of interest to the “world of intelligence”, 
which I mentioned (Chap. V, § 5, note 36) without waiting for M. Augereau and 
the mathematicians he interviewed, is in no way obvious. Upon its invention 
by Diffie and Hellman in 1976, the principle of public-key cryptography was 
made. .. public,” after which Rivest, Samir and Adelman invented and they too 
made public® their system based on the factorization of very large integers; it 
was commercialized by a company, RSA Systems, which today is worth billions 
of dollars on Wall Street. It stands to reason that a cryptography system can 
really be of use to a government agency — in this case, the American National 
Security Agency — only if it has exclusive rights to it, particularly concerning 
an almost indecipherable system. Making it available for everyone was a first 
in the history of the subject, and from the point of view of the NSA, was ex- 
tremely absurd since the Soviets, who were not “bad at Maths”, could use it. 
All this was done despite opposition from the NSA and a 1951 law that would 


‘ 


The main source for the topic is L. F. Haber, The Poisonous Cloud (Oxford UP, 


1986), whose author, the son of the chemist, is a specialist of the history of the 
chemical industry. 

After the war, the new technique made it possible to introduce large quantities of 
chemical nitrogen fertilizers — this was in theory BASF’s original purpose — and 
to almost multiply the world population by four in one century. Chemists have 
“revolutionized society” long before von Neumann and computer scientits. Va- 
clav Smil, Enriching the Earth. Fritz Haber, Carl Bosch, and the Transformation 
of World Food Production (MIT Press, 2001). 

Including in France, thanks to the translation (Pour la Science, October 1979) 
of one of Hellman’s article in Scientific American. 

Communications of the ACM, 21, 1978, pp. 120-126. 
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have allowed it to “classify” these new methods,’ obstacles which resistance 
from scientific circles and the enormous power of business successfully over- 
came. As recently written by an American journalist, the outcome!® is that 
the efficiency of the NSA, submerged under a flow of messages especially due 
to the development of the internet, has significantly diminished, the process 
being heightened by the development of fibre optics. 

(4) If Hardy must be contradicted, it would have been better to remark 
that making the new methods of cryptography available to international trade 
and money launderers is not such as to reduce existing inequalities in the dis- 
tribution of wealth. 


Computers have made it possible to calculate more than a billion zeros of the ¢ 
function, all on the critical line, i.e. whose real part is 0,5000... with a very large 
but finite number of zeros. This keeps calculators busy and were it needed, would 
reassure mathematicians; but it does not prove anything. Everyone has always 
believed in Fermat’s theorem, but the problem was proving it, not checking it 
numerically. This proved to be tremendously more interesting than the statement 
itself. This will obviously also be the case of the Riemann hypothesis and it is 
mainly for this reason that it fascinates those who hope to be able to justify it. 

Similarities between the Riemann function and series associated to algebraic 
curves have now convinced specialists that methods of algebraic geometry will lead 
to the proof, as was the case with Ramanujan’s conjecture (Chap. IV, §3, n° 20) 
and with Fermat’s theorem. It may well be so. Others consider combining them 
to those of non-commutative harmonic analysis, i.e. to generalizations of Fourier 
transforms to non-commutative groups like SL2(R). In such cases, the best is to do 
like the English: wait and see. 

Riemann showed that his hypothesis, whose statement is at a first glance anec- 
dotal, was connected to the statistical distribution of prime numbers. Using prime 
number and logarithm tables, Gauss and Legendre had conjectured that, if (zx) 
denotes the number of prime numbers p < x, then 


(2.12) a(x) ~ x/logx 


and what is even somewhat better, 
(2.13) n(x) ~ li(a) = / dt/logt (“logarithmic integral”) . 
2 


Exercise. Show that 
li(x) © x/logx — x/log?2+a/log?a—... when « —> +00. 


° Among the articles I collected at the time, I find Duncan Campbell, Whose eyes 
on secret data? (New Scientist, 2 March 1978), Cryptology: A Secret Meeting at 
IDA? (Science, 200, 14 April 1978), Intelligence Agency Chief Seeks “Dialogue ” 
with Academics (Science, 202, 27 October 1978), Prior Restraints on Cryptog- 
raphy Considered (Science, 208, 27 June 1980), Cryptography: A New Clash Be- 
tween Academic Freedom and National Security? (Science, 209, 29 aotit 1980), 
High Technology: Back in the Bottle? (Technology Review, aotit/septembre 
1981), Christopher Paine, Admiral Inman’s tidal wave (The Bulletin of the 
Atomic Scientists, March 1982). 

10 Seymour M. Hersh, The Intelligence Gap, The New Yorker, 6 December 1999, 
by the author of a famous book on the “My Lai massacre” during the Vietnam 
war. 
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In 1896 Jacques Hadamard and Charles de la Vallée-Poussin independently obtained 
proofs of these results,'! related to the non-existence of zeros of ¢(s) on the line 
Re(s) = 1, and they have been much improved since. But the Riemann hypothesis 
is equivalent to 


a(x) =li(x) +O (x2**) for alle > 0, 


a far stronger result than the previous one. With their probabilistic conception of 
real mathematics, obviously, cryptography experts do not hesitate to make use of 
it. 


3 — Weil’s method for the Function n(z) 


In Chap. IV, §3 of n° 20 entitled“ strange identities”, we introduced the function 


(3.1) n(z) = e"™/? II (1 — exp(2minz)) = q'/” II (1- gees 


n>1 
where q is set to be 
q = exp(miz), q: = exp(mis) 


for Im(z) > 0 and s € C. These are simple abbreviations, the variable still being 
z and not qg. As was seen then, the infinite product is absolutely convergent for 
Im(z) > 0 since |g] < 1. Moreover, since the series )> exp(27inz) = > e(nz) is 
dominated in the half-plane y > r by the convergent series 5+ exp(—2ar)” for 
all r > 0, the infinite product is normally convergent in such a half-plane. So 
function (1) is holomorphic on the half-plane Im(z) > 0 (Chap. VI, §4, n° 20, 
theorem 18). In Chap. IV, we stated the two functional equations satisfied by the 
7 function. We are now going to show how the Mellin transform simplifies their 
proof using a method ' which amounts to reversing the arguments leading to the 
functional equation of the zeta function; this is a great exercise. A proof using the 
series expansion 


n(z) = exp(miz/12) § > (—1)” exp [n(n + 3) riz] 


and the Poisson summation formula will be given later (§3, n° 7). 


Theorem 2 (Dedekind). The following relations hold: 

(3.2) n(z +1) = exp(mi/12)n(2), 
ne 

(3.2”) m(—1/z) = (2/1)? n(z). 


The first relation is obvious. By analytic extensions, it is sufficient to prove 
the second one for purely imaginary z = iy with y > 0. The 7 function, being 


11 See for example Freitag & Busam, Chap. VII, §6, or the last chapter in Serge 
Lang’s Complex Analysis (Springer-Verlag, 4th. ed., 1999). All that was known 
before 1978 can be found in K. Prachar, Primzahlverteilung (2. Aufl., Springer- 
Verlag). 

2 André Weil, Sur une formule classique (J. Math. Soc. Japan, vol. 20, 1967, 
pp. 400-403). This type of method was in fact invented in 1936 by the great 
expert of the theory of modular functions, Erich Hecke. We could even go back 
to Riemann, with his proof of the functional equation of the ¢(s) function. 
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represented by an absolutely convergent product all of whose terms are non-zero, 
has no zeros (Chap. IV, §3, n° 17, theorem 13) and for z = iy, all its terms are 
> 0. Hence its logarithm may be considered. It is somewhat simpler to study the 
function 


f(y) = —ry/12 — log n(iy) = — Slog (1 — exp(—27y)”) = 


(3.3) = ys exp(—27y)'"""/m, 


m,n>1 
the logarithmic series being justified since exp(—27y) < 1. By the general theorems 
(Chap. II, §2, n° 15 and 18), the double series (3) with positive terms obviously 


converges unconditionally. So the terms can be arbitrarily arranged (associativity). 
The Mellin transform of f is 


Py(s) = 9s) = | y'd'y Sy m™ exp(-2nmny) = 
= Dy fm exp(—2rmny)y*d*y = 
=n) Le) i on 


whence 
(3.4) p(s) = (2m) *I'(s)¢(s)¢(s + 1). 


This formal calculation is justified by the convergence of the series whose general 
term is 


if |m~* exp(—2rmny)y*| d*y = (2x) °P'(a)/m7**n? , 


where o = Re(s), i.e. the product of two absolutely convergent series for Re(s) > 1. 
As the ¢ function has a rather remarkable functional equation, namely 


(3.5) E(1—s) =£(s) where €(s) =m */7I(s/2)¢(s), 
p(s) needs to be compared to the function 
E(s)€(s +1 =r 2 T(s/2)P [(s + V/A C(8)C(s + 1), 


However, the duplication formula (1.11) of the gamma function 


3.6) I'(s/2)P ((s + 1)/2) = 122! °I(s) 
shows that 

3.7) &(s)E(s + 1) = 2(27) *I'(s)¢(s)¢(s +1). 
Hence 

3.8) pls) = 5€(s)E(s +1), whence o(s) = o(-s). 


Since €(s) is meromorphic and its only singularities are simple poles at s = 0 and 
s = 1, y(s) extends analytically to all of C, except for simple poles at 1 and —1 
and a double pole at s = 0 with residues 
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Gay ‘Reedy 562) Res(é,1) = x2) = am PC) =n/12 


since ¢(2) = 17/6, and so, by symmetry, 
(3.9’) Res(y, —1) = —7/12. 


In the neighbourhood of s = 0, there are series expansions 


€(s) =—-1/steotcs+..., 
(s+ 1) = &(-s) =1/st+co—ast+..., 


whence 


(3.9”) y(s) = —1/28” +a0 +428" +... 


whose coefficients are of no relevance for the rest of this book. 
Let us now show that the Mellin inversion formula 


(3.10) arifty) = feta dy 
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can be applied to f. Indeed, the function t +> y(o + it) is the Fourier transform 
of the integrable function u +--+ flexp(27u)]exp(270u). But as €(s) is rapidly 


1 


decreasing at infinity on every vertical, so is y(s) = 5&(s)€(s + 1). The Fourier 
inversion formula then proves (10) as in the proof of theorem 14 of Chap. VIII, 


n° 13. 


To exploit relation y(—s) = y(s), (10) needs to be compared with the similar 
integral along the vertical Re(s) = —o, i.e. the residue formula needs to be applied 
to the contour bounded by verticals o and —o and horizontals T and —T. At the 


limit, these do not contribute to the integral since |y~*| = y 7 


is independent of 


T, whereas y(o + iT) is rapidly decreasing. On the other hand, the integral over 


Re(s) = —o reduces to an integral over Re(s) = o by s+ —s. Setting 


therefore, gives 


ni Rest) =f 9, HOC ds = 


= / [y°y(s) — y°y(s)] ds = 
Re(s)=o 
(3.11) = 2nilf(y) -— fU/y)] , 


whence 


f(1/y) = f(y) — >= Res(). 


The residues of w(s) = y(s)y~° at poles 1, 0 and —1 remain to be calculated. As 1 


and —1 are simple poles of y, by (9) and (9’) 
Res(w, 1) = 7/12y, Res(w, —1) = —my/12. 
In the neighbourhood of 0, by (9”) 
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p(s) = —1/2s*+a0+..., 
y ° =exp(—slogy) =1-—slogy+..., 


whence 
Ree) Slosy. 
So 
F(/y) = fly) — 2/12y + my/12 = 5 logy, 
(3.12) f(y) + xy 1/12 = Fly) + ry/12— 5 log y. 


Since, by (1) and (3), n(¢y) = exp[—f(y) — my/12], it follows that 


; dy 
n(i/y) = y? n(iy) - 
Thus, by analytic extension, we get the strange identity 
ack 
m(—1/z) = (2/4)? n(z) 
that had been prematurely announced at the end of Chap. IV, §3 in the hope of 


inspiring new Ramanujans. 
Exercise. Let w be a complex number such that |w| <1, w 4 1. Set 


f(y) = —log (1- we™*) = Southey: 


Show that f satisfies the assumptions of Chap. VIII, n° 13, theorem 14 and deduce 
that the function 


I(s) Ss w" /ne* 
extends analytically to C. 


Several infinite products similar to the Dedekind function are encountered in 
the theory of elliptic or modular functions. Like the latter, they all have simple 
functional equations that can be proved in the same way. 

Exercise 1: functional equation of F(z) = q7/4]](1 + @"~"). The Mellin 
transform of 


(3.13) f(y) = my/24 — log F(iy) = SO(-)™e On "4 Im 
is given by 

(s)/m *I(s) = S\(-1)"/(2n — 1)°m5*? = 
(3.14) = Sey ya e 1/(Qn—1)°. 
Using (7) and 


5 1/@n — 1)° = ¢(s) — $7 1/(2n)* = (1-27) G(s), 
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one deduces that 
g(s) =m *I(s)¢(s)¢(s +1) (1— 27°) (2° -Y = 
(3.15) = —€(s)€(s +1) (1— 27°) (1— 2°) = y(-s). 


Like €, the function y is rapidly decreasing on every vertical strip since the functions 
2° and 2”* are bounded there. The function 


(3.16) %(s) = o(s)y* = 5&(s)&(s + 1) (1— 2°) (1-2) y™® 


being integrated is, like y, rapidly decreasing on all vertical strips since the factor 


y ° is at most of polynomial growth. Hence the Mellin inversion formula applies, 


and calculating as in (11) gives 


fQ/y) = ) — $5 Res(ib) 


the residues being with respect to poles contained between verticals o > 1 and —o. 
But (24) shows that, like the function €(s)&(s+1), ~ has at most poles at points 1, 0 
and —1. The corresponding residues remain to be calculated. 

As the poles of $€(s)(s +1) at 1 and —1 are simple, it suffices to multiply the 
residues by the values of the function (1 — 2°)(1 — 2~*)y~*° at 1 and —1, ie. by 
—1/2y at s=1 and —y/2 at s = —1, whence 


(3.17) Res(w, 1) = —7/24y, Res(w, —1) = wy/24. 


The function 4€(s)€(s + 1) has a double pole at s = 0. It is canceled out by the 


double zero of the factor (1 — 2°)(1— 27°), thus the corresponding residue is trivial. 
So finally, 


(3.18) fQ/y) = fly) — wy/24 + 1 /24y. 


One, therefore, concludes that the function 


F(z)=q'/*4 II (1+ ge) where g = exp(7iz) 
satisfies the functional equation 
(3.19) F(-1/z) = F(z) 
and trivially, 
F(z+2)=q /?F(z). 


Exercise 2: functional equation of G(z) = q7/*4]](1 — ¢?"~1). The Mellin 
transform of 


f(y) = my/24 — log Gliy) = Soe OPV /m 
p(s) = mE (8) ¥o 1/m*** (an — 1)° = (2° — 1) (2m)~* P(s)6(8)6(s + 1) = 


= 5 (2-1) €(s)&8 +), 
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from which one concludes that 


y(-s) = —2°“¢(s). 


The residues of the function o(s) = y~°¢(s) are 7/12y at s = 1, —4log2 at s=0 
[since 2° = exp(slog2) = 1+ slog2+...] and ry/24 at s = —1. Here method (11) 
gives 


1 
Fly) + f(2/y) = m/12y + my/24 — 5 log2, 
and so, replacing y with 2y, 
(3.20) G(2z)G(-1/z) = V2 


by analytic extension. We could have deduced this result from the functional equa- 
tion of the Dedekind function by observing that 


G(2z) = n(z)/n(2z). 


Exercise 3: functional equation of H(z) = q'/*4T] (1 — (—q)"). We start from 


f(y) = —y/24 — log H(iy) = S0(-1)™" exp(—mmny)/m  (y > 0). 


A few calculations then show that the Mellin transform of this function is 
—s mn s s 1 
os) =m *1(s) S3(-1)""/m**'n® = 5€(s)E(s + 1a(s)a(—s), 


where a(s) = 2°/? + 1 — 2-8/2. So it is invariant under s » —s. Calculating the 
residues and using the inverse Mellin transform like in previous exercises lead to 
the functional equation 


H(—1/z) = (2/i)? H(z). 
Observing that 
n (1/2 + 2/2) = exp (11/24) H(z), 


the relation obtained means that the n(z) function satisfies 


(3.21) n(1/2—1/2z) = (z/i)?n (1/2 + 2/2) . 


With a few lines of calculations, one can deduce this relation from the functional 
equations of 7(z): 


n ((z — 1)/2z) = exp n(—(z+1)/2z) = 

22/i(z + 1))]? n[2z/(z+ 1)] = 

et 1)]?2 n[2—-2/(2+1)] = 
[-2/(z+ 1)] = 

(z + 1)/2i]? 7 [(z +.1)/2] . 


eS 


This proves (21) provided 


exp(mi/A) [2z/i(z + 1)]? [(z + 1)/2i]? = (z/) 


NR 
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holds. Despite appearances, this is not obvious’? since, by (14.12) of Chap. VIII, 


Xo 
(*) (wiw2)? = w2w2 <> |Arg (wi) + Arg (wa)| <7, 


which excludes formal calculations available for integer exponents. However, like 
z, the points (z + 1)/2 and 2z/(z +1) are in the upper half-plane, so that their 
arguments are between 0 and 7. The points 


wy = 2z/i(z +1) = —2iz/(z+1) and we = (z24+1)/2i = -i(z+1)/2 
occurring in (15) are, therefore, in the half-plane Re(w) > 0. So | Arg(w1)| < 1/2 
and | Arg(we)| < 2/2. Thus, condition (*) holds. Hence 
[22/i(z + 1)]2 [(z +1)/2i]? = (-2)2 


as expected. But as Arg(—z) € ]—7,0[, Arg(z/t) = 2/2 + Arg(—z) has to be 
chosen in such a way that it stays in the interval |—7,+7][, whence (z/i)2 = 
(—z)2 exp(mi/4), ged. 

Exercise 4: functional equation of P(z) = q'/74T](1 + q2”). This time we set 


f(y)= log] | (1 + ee) = So(-1)""? exp(—2amny) /m, 


and so, for the Mellin transform 
os) = (20) *P (8) S(-)"1 /metn® = 5 (1-27) &(s)e(s +1) 


and y(—s) = —2°y(s). The residues of y(s)y° are 7/48y at s = 1, 0 at s = 0 and 
my/24 at s = —1. The Mellin inversion then shows that 


f(y) + fQ/2y) = /48y + ry /24, 
whence 


(3.22) P(z)P(-1/2z) =1. 


13 More generally, if a,b,c,d are integers such that ad — bc = 1, setting 7(z) = 
(az + b)/(cz + d), we get 


nla) = ey) (cz +4)? n(2), 


but the explicit calculation of the constant €() encounters arithmetical problems 
which, however “elementary”, were not solved before 1931 (Hans Rademacher) 
or, with a simpler formula, 1954 (Hans Petersson), despite the fact that the 7 
function was known for a long time. See for example M. I. Knopp, Modular Func- 
tions in Analytic Number Theory (Markham, 1970). A similar problem arises for 
the @(z) function. It was solved by Erich Hecke in 1944; see for example Neal 
Koblitz, Introduction to Elliptic Curves and Modular Functions (Springer-Verlag, 
2d ed., 1993), pp. 148-152. 
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To conclude this n°, let us note the similarity between 


n(z) = gee II (1 = q”) 
and the function 


TL Q- 2") = Spl?” =P ney? 


of partition theory (Chap. IV, §3, n° 20; the variable q of Chap. IV has been 
replaced by q”). A detailed study of the behaviour of 7(z) in the neighbourhood of 
the real axis leads to precise information about the asymptotic behaviour of p(n) 
for large n, for example the asymptotic formula 


1 1 
n)~ ——~e 2n/3) 2 ‘ 
P(r) ~ Ta yT ex? [(2n/3)? x 
due to Hardy and Ramanujan, then an expansion of p(n) into a convergent series 
due to Hans Rademacher, the author of what still is one of the most beautiful books 


on analytic number theory.'* 


14 Dieudonné gives, obviously without proofs, a very very good idea of the sub- 
ject in the Dictionnaire des Mathématiques, algébre, analyse, géométrie (Paris, 
Encyclopaedia Universalis and Albin Michel, 1997) and, probably wising to give 
educated populations an idea of the power of mathematics, writes the 27 digits 
of p(721). Let us also draw attention to the excellent article by Jean-Luc Verley 
on analytic functions in the same volume. Except for some details, the articles 
of the Dictionnaire can be found in the many volumes of the Encyclopaedia 
Universalis. 
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4 — The series )* 1/ cos anz 


We showed in Chap. VIII, n° 15 that the function 1/ch7z is equal to its Fourier 
transform. A change of variable then shows that, for all t > 0, the Fourier transform 
of a ++ 1/ch(7a/t) is c +> t/ch(rat). This function being in the Schwartz space 
S(R), the Poisson summation formula can be applied to it; thus 


(4.1’) y(1/t) = ty(t) where y(t) = S> 1/ ch(rnt) 


and more generally, 


(4.1”) > 1/ch [x(u+ n)t~*] = iy. e(nu)/ ch(rnt) 


for allu€ R. 
Let us now consider the analogous series 


(4.2) f(z) == 1/cos(rnz). 


We begin by showing that it converges normally in all half-planes of the form 
Im(z) > r > 0. Indeed, in this half-plane 


2 |cos(rnz)| = et (ny—inz) 4 e™(—nyting) 


2 


n\|n|r —n|n|r 
e 


> 


> =e > etlnlr _ 4. 


The convergent series 3> 1/(e7!"!” — 1) thus dominates series (2) in the half-plane 
considered. 
We now now that f satisfies two simple functional equations. First, 


(4.3) f(z+2) = f(z). 


On the other hand, function (2) is holomorphic on the half-plane Im(z) > 0 and 
reduces to y(t) for z = it. Relation (1’) meaning that 


(4.4’) f(-1/2) = (2/t).-F() 


for imaginary z and both sides being analytic on the half-plane Im(z) > 0, (4’) holds 
on the latter. 
Similarly, relation (1”) shows that, for u € R, 


(4.4”) $5 1/ cos [-m(u + n)/2] = (z/i) S- e(nu)/cos(rnz). 


Relations (3) and (4’) resemble those proved in Chap. VII, n° 28 for the Jacobi 
function 


6(z) = S> exp (min? z) =14 2(q tgi+q?+q'® 4 ae) : 
where 
q = exp(miz). 
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Here too, the series converges for Im(z) > 0. Applying the Poisson summation for- 
mula and the fact that the function 7 +> exp(—72”) equals its Fourier transform, 
we have already shown that 


6(—1/z) = (z/i)26(z), 


which is the reason why Riemann used it to prove the functional equation of his 
¢(s) series. Obviously, 


O(z + 2) = A(z) 
holds as well. Thus the function @(z)* satisfies (3) and (4’). In fact, 
(4.5) f(2) =0(2)?. 


The proof of this equality is going to take us directly to the theory of modular 
functions in the next n°. 

Like the theta series, f has an easily computable Fourier series expansion 
(Chap. VII, §4, n° 17). Indeed 


{@)=144 7 am = 144 Oe = 
m>1 q m>1 k>0 
(4.6) =14+4 So (-1)*gP9P 2144S > ang” 
m>1,k>0 n>1 


with a series which theorem 7 of of Chap. II, n° 15 tells us converges unconditionally. 
The coefficient a», is the sum of the numbers (—1)* for all ordered pairs (k,m) such 
that k > 0, (2k+1)m = n, But the existence of an integer m such that (2k+1)m =n 
means that 2k + 1 is an odd divisor of n. We infer from this that 


Qn = number of divisors of n of the form 4k + 1 — 


(4.7) —number of divisors of n of the form 4k + 3, 


the number 1(k& = 0), and possible also n, always occurring among the enumerated 
divisors. Easy calculations then show that 


f@)=144(at+ PtP +2 +g +g? + 2g? + 2g 4+...). 


For n = 12 for example, odd divisors are 1 = 1(mod 4) and 3 = 3(mod 4), and so 
Gn = 0. For n = 25, these are 1, 5 and 25, all = 1(mod4), and so a25 = 3. For 
n = 50, the odd divisors are the same, thus again a59 = 3. 

To make conjecture (5) plausible, let us calculate the expansion of 6(z)?. 


Poa a2 
0(z)= oq =14+2S¢q 
Z x>1 
and so 


6(z)? =1+ 4g” +4 >> gh TY 14430 dng”. 


x>1 xjyol n>1 


If n is not a square, by is clearly the number of ordered pairs (x,y) with integers 
x,y > 1 such that «?+y? =n; ifn is a square, 1 needs to be added to this number 
to obtain b,, which amounts to admitting the ordered pair (x, 0) ifn = x”. Hence, 
in all cases, bn is the number of ordered pairs (x, y) such that 
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(4.8) et+y=n, «>1, y>0. 


Again easy calculations then show that the beginning of the expansion is the same 
as that of f. For n = 12, (8) has no solution, whence bj2 = 0. For n = 25 = 5? = 
37 4+ 4? = 4? + 37, we find bes = 3. For n = 50, 


50=P47HP4P H=57 +5", 


whence bso = 3. The reader having access to a computer will be able to check 
further if, like the new cryptographers, he considers it important to replace obsolete 
deductive mathematics of the past by new inductive or experimental mathematics. 
Hence equality f(z) = 6(z). 

Admitting it without proof, we get a classical result of arithmetic: the number 
of solutions of (8) is given by formula (7). 

There are elementary proofs of this result, for example in Hardy & Wright. 
Conversely, (5) can be deduced from the classical result.1° 


5 — The Identity 5° 1/ cos mnz = 6(z)? 


As for proving directly that f(z) = 0(z)*, as stated above, it is a problem in the 
theory of modular functions. In fact, apart from the infinite product expansion of the 
@ that has not yet been proved, everything can be proved “without any knowledge” 
of this theory, or almost without. Some of its features will be presented in § 5. 


(i) The fundamental domain of I'(@). First, the function 


g(2) = f(2)/0(2? 
is clearly invariant under the maps 
(5.1) Tizt+>z4+2, S:z-+-1/z 
from the half-plane 

P = {Im(z) > 0} 


to itself, hence also under every map T?S?T"..., where p,q,7r,... are arbitrary 
rational integers (in fact it suffices to give the exponents of S the values 0 and 1 
since S? = id). These form a group I'(@) of conformal representations 


(2) = (az +6)/(cz +d) 


of P on itself, with coefficients a,b,c,d € Z such that ad — bc = 1, like S and T. 
Multiplication of transformations of this type is similar to that of the corresponding 


15 See Jacques Stern, La science du secret (Paris, Odile Jacob, 1998), Chapter VI. 

16 This is what Freitag and Busam do, in exercises of their chapter VII, §3, Funk- 
tionentheorie, albeit they write f(z) in the less spectacular form that can be 
obtained by permuting the summations with respect to k and m in the first dou- 
ble series of equation (6), which conceals the cosmnz. Their method consists in 
comparing the Mellin transforms of f(z) and 6(z)’, which are quite well-known 
Dirichlet series. We will return to this later. The method used here, directly taken 
from Freitag and Busam (chap. VII, 1.8), is standard in the theory of modular 
functions. I did not find anywhere calculations leading directly to the functional 
equation of f(z) and to the behaviour of f(1 — 1/z) at infinity, but then the 
literature is considerable. 
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matrices‘’ (a blc d). The latter form the modular group. Transformations 7 € I'(@) 
may be showed to be characterized by the additional condition 


a+b+c+d=0 (mod2), 


but it is less obvious and useless. When studying elliptic functions in C with fun- 
damental periods w; and w2 (Chap. II, §3, n° 23), it is not necessary to consider 
them on all of C. It suffices to consider them on the parallelogram P generated by 
these two vectors since any z € C can be transformed into P by adding periods. We 
thus need to imitate this procedure by replacing C and the group of periods by the 
half-plane P and the group I’(@), i.e. to construct a subset F of P, of the simplest 
form possible, such that, for any z € P, there exists a transformation y € I'(0) that 
maps z to a point of F. This amounts to requiring that 


(5.2) P=) 4A: 


ye (9) 


The ideal would be to make sure that the sets y(F) are pairwise disjoint, but this is 
impossible since there may exist y € I'(@) such that y(z) = z, for example S which 
leaves the point 7 fixed. It is, however, possible to choose F in such a way that 
i) F is the closure of some open set, (ii) the intersection of two images y'(F) and 
y""(F) only contains subsets of their boundaries. This is what happens for elliptic 
functions. The next lemma describes such a fundamental domain of the group I’'(6) 
Chap. XI, n° 15, (vii)]. 


Lemma. The set F defined by inequalities 
5.3) 0 < Re(z) < 2, lz} >1, jz—-2|>1. 


satisfies (2). So does the set F defined by 


5.3’) |Re(z)| <1, jz] > 1. 


To see this, we first note that if y = (a b|c d) is a real matrix with determinant 
ad — bc = 1, then 


(5.4) Maen if (<4 <<") _ Im(z) 


2i\cz+d ce+d lez +d?” 


This formula shows that y maps P to P, and since so does the inverse y~! = 
(d —b|—c a), any matrix of this form defines a conformal representation of the 
half-plane P onto itself. 

This being so, for given z € P, let us consider the imaginary parts of the points 
y(z) for y € I'(0). As both c and d are non-trivial integers, the numbers cz + d 
appearing in (4) belong to the lattice generated by 1 and z in C. Hence c and d 
can be so chosen that |cz + d| is minimal, hence that Im[y(z)] is maximal. As the 
translation T : z > z+ 2 leaves Im(z) invariant, multiplying y with a power of T, 
one may even suppose that | Rey(z)| < 1. Since S € I'(6), the image 2’ = y(z) of z 
obtained thereby then satisfies 


|Re (z')| <1, Im (-1/2’) <Im (z') : 


'’ In what follows, the notation (a b|c d) will sometimes denote the 2 x 2 matrix 
whose first row is a b and the second one c d. 
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But Im(—1/z’) = Im(z’)/|z’|?. Therefore, both 
[Re (z')| <1, jz"; >1 


hold, whence (3’). 

We proceed in the same way to reduce to inequalities (3). For given z € P, 
choose 7 such that Im(yz) is maximal. So as above, |cz’ + d| > 1 for any matrix in 
the group. Using z +> z+ 2, transform z’ into the strip bounded by the verticals 
of 0 and 2. Relation Im(—1/z’) < Im(z’) shows that |z’| > 1, inequality |z’—2| > 1 
being obtained in a similar way using T'S : z +> (z — 1)/(z — 2), qed. 

Pursuing these calculations further, the pairwise intersection of the images 7(F) 
could be shown to only contain boundary points, but this is not needed for what 
follows. 


0 1 2 
Fig. 1. 


The above figure represents set (3). Observing that any homography z +> 
(az+b)/(cz+d) with real coefficients and determinant 1 transforms a circle centered 
on the real axis or possibly at the limit of a vertical into a figure of the same type, its 
images under y € I’(0) are easily obtained. The images of (3’) form a tessellation of 
the half-plane P with closed triangles bounded by circular arcs centered on the real 
axis, or with verticals, and accumulating on the real axis. The reader can practise 
drawing the tessellation obtained from the set F defined by (3) or (3’). 


(ii) A general method. Let us consider a function g(z) in P which is holomorphic 
everywhere and invariant under the group I'(@). We intend to search for conditions 
enabling us to affirm that g(z) = 1. The analogous problem in the theory of el- 
liptic functions is easy to solve (Chap. VII, §4, n° 18): if an elliptic function g is 
holomorphic everywhere, the continuous function |g(z)| has a maximum in a period 
parallelogram since it is compact, hence by periodicity in C. The result follows. 

Likewise, in the case at hand, it would suffice to show that |g(z)| has a maximum 
in the closed set F. The latter is not compact but can be made to be so by removing 
all z such that either Im(z) > c for given c > 0, or |z — 1| < € for given € > 0. So, 
to show that g = 1, it suffices to show that g(z) tends to 1 as z € F tends either to 
infinity or 1. Indeed, if this is the case, then, for all r > 0, inequality |g(z) — 1] >r 
gets rid of all points z € F too near 1 or infinity, hence defines a compact set 
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K(r) C F. If g — 1 were not identically zero, then K(r) would be non-empty for 
sufficiently small r and there would be some a € K(r) such that 


lg(z) — 1] S$ g(a) - JJ 


for all z € K(r), hence for all z € F, thus by “periodicity” for all z € P, a 
contradiction. 

So it suffices to analyze the behaviour of g(z) either for large Im(z) or for z € F 
near 1. In the second case, we set 


z=1-1/¢, whence ¢=1/(1-2z). 


The half-circles |z — 1| = 1 and |z — 2| = 1 bounding F are transformed into the 
verticals Re(z) = $ and Re(z) = —3, so that making z tend to 1 on F reduces to 
making ¢ tend to infinity on the strip bounded by these verticals. Thus it follows 
that to prove g(z) = 1, it suffices to show that 


(5.5) lim g(z) = lim g(1—1/z) =1 


as z tends to infinity on the strip 0 < Re(z) < 2 in the former case, on the strip 


—} < Re(z) < 4 in the latter one. Thus it is all the more sufficient to check (5) as 


Im(z) tends to +oo. This is what we will do. 
(iii) The identity f(z)/0(z)* = 1. Let us then come back to the function 


9(z) = f(2)/(z)’, 


invariant under the group I’(@). It is holomorphic on P because the theta series has 
no zeros there thanks to a miraculous infinite product expansion 


A(z) = Il (1—4°") (14 ae ,q = exp(miz) , 


n>1 


a more general form of which will be proved in the next n°. 
To study the behaviour of g as Im z tends to infinity, i-e. as q tends to 0, observe 
that 


O(z)=14+2(qtC+et...), fl) =14+ do ang”. 
n>0 


The first condition (5) thus trivially holds. The second one requires some calcula- 
tions. Summation is over all n € Z in what follows. 


O(z+1)= by exp (win? z + rin) 
since n and n? are either both odd or both even. As our study concerns 6(1 — 


1/z), the Poisson summation formula which we have successfully used for the 6(z) 
series is again required. The Fourier transform of t +> exp(mit?z) being u +> 


(z/i)~2 exp(—aiu?/z), the general formula 


:S p(t +n) = oe £(n) exp(2z7int) 
shows that 


(5.6) ~— exp ( nin? /z 4 2rint) = (z/i)? S| exp [wiz(t + n)?| : 
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For t = 3, this gives 


A(1 — 1/z) = (z/i)? )~ exp 


: Ls Ado n(n+1) 
TZ (n+ 5) | = (z/i)?2q4 Soa = 


qi (1+ @+a°t+...). 


NIR 


= 2(z/i) 
Hence 
(5.7) 6(1—1/z)? ~ 4(z/i)q? as Im(z) —> too. 
If the same result is obtained for f(z) = De 1/cos7nz, this will prove the second 


condition (5), and hence that f(z) = @(z)°. 
To this end, we need to study the behaviour of 


f(l 1/z) = os —— = ye e(n/2) 


(—rn/z) cos(—17n/z) 


as Im(z) tends to +00. The formula 


(4.4”) S51 /cos [r(u+n)z] = (2/i)~* S$ e(nu) /cos(—n/z) 


obtained above is going to lead to the result: putting u = -4 gives 


f(1= 1/2) = (2/A) Y91/ cos [= ¢ 2 5) 7 us 


The latter series converges normally for |q| < r < 1. So it is possible to pass to the 
limit term by term as q tends to 0. The term n = 0 tends to 1 and the others to 0, 
so that the sum of the series tends to 1. Hence 


(5.8) fQ-1/z) ~ 42/892 
in accordance to (7). The result follows. 


Theorem 3. For Im(z) > 0, 


(5.9) s 1/cosmnz = 0(z)?. 


Z 


Other authors prefer writing 


2 buy oA 
n2 
(5.9) dia”) =1445-—7, Id <t. 
Z Et tog 
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6 — The Infinite Product of the Function 0(u, z) 


The previous proof is based on the equality 


(6.1) O(z) = Il (1 q”) (14 ae » qg=exp(miz), 


n>1 


which is necessary (or sufficient — but no other proof is known) to show that 6(z) 
does not have any zeros in the half-plane. The calculations of n° 3 and the arguments 
used above to show that 7 1/cosmnz = 0(z)” enable us to prove (1). 

Once again, let us consider the functions 


nz) =¢/P TT (1-@") , 
P(e) = VT] (1+ 2-2) 


of n° 3. Equality (1) which we need to prove becomes 


6.1’) (2) = n(2)F(2)?. 

As 
mlz +1) = exp(mi/12)n(z) , n(-1/z) = (2/1)? 9(z) 
F (z+ 2) = exp(—7i/12)F(z), F(-1/z) = F(z) 


NIK + 


O(z+2) = A(z), 6(-1/z) = (z/t) 


The ratio 0(z)/n(z)F(z)? is clearly invariant under z +> z+ 2 and z +4 —1/z, 
hence under the group I'(@) of the function 6. The denominator, a convergent 
infinite product, has no zeros in P. Like in the proof of the previous theorem, to 
prove (10’), it will, therefore, be sufficient to show that, as Im z tends to infinity, 


6(z) ? 


(6.2’) A(z) ~ n(z)F(z)*, 
(6.2”) A(1 — 1/z) ~ n(1 — 1/z)F(1 — 1/2). 


The first point is clear: as z € F tends to infinity, i.e. as q tends to 0, 0(z) ~ 1 
and 
nayrg!?,  F(z)~q 


since the infinite product tends to 1. 
To obtain (2”), we first observe that, by (5.7), 


O(1 — 1/z) ~ 2(z/i)qi’*. 
On the other hand, 
n( = 1/2) = exp(mi/12)n(=1/2) = exp(mi/12)(2/8)2 n(2) 
and so 
n(1 — 1/2) ~ exp(mi/12)(2/i)2q'/”. 
Finally, to obtain the behaviour of F'(1 — 1/z), we use the relation 
F(z +1) = exp(—7i/24)G(z), 
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where G(z) = q-\/**]](1 — q?""') is the function of exercise 2 of n° 3. Formula 
(3.20) then shows that 


F(2z4+1)F(1—1/z) = V2exp(—7i/12). 
As F(2z +1) = e~ @#+)74/24 771 — g4"-?) and hence as 


F(2z +1) ~ exp(—mi/24)q_1/, 

it follows that 

F(1— 1/2) ~ Vaexp(—ni/24)ql/? 
and as a result, 

(1 — 1/z)F(1— 1/2)” ~ a1 — 1/2), 
proving (1). 

Exercise. (1) is equivalent to 

(6.3) 6(2)n(2) = exp(wi/12)n[(z +1)/21? . 


As (1) is only a particular case of a much more general formula whose classic 
proof uses far more “elementary” methods, the previous proof is likely to make 
experts scream; but I am not writing for them. Moreover, (1) is going to provide 
us with a very quick proof of the general formula: 


Theorem 4 (Jacobi, 1828). For |q| <1 andweC*, 


(6.4) > g” w™ = II (1—@") (1+¢7""1w) (14+¢7?""*w") . 


n>1 


In what follows, J(q,w) will denote the Jacobi series and A(q,w) the Abel 
infinite product like in Remmert, Funktionentheorie 2, pp. 22-27, which presents 
the classic proof that I will not follow, except for its quasi-trivial part. Mine(?) will 
show the reader getting to grips with these questions that he has not worked in 
vain by following the calculations of n° 3. 

By Weierstrass’ general theorems on normally convergent infinite products 
(Chap. VII, n° 20, theorem 18), for given g, A(q, w) is clearly a holomorphic function 
of w on C*, so has a Laurent series expansion 


A(q,w) = J an(q)w". 
Z 
Replacing w by q?w in the product A(q, w), removes the factor 1+qw and adds 
a factor 1+ q7-'w~' =q-'w7'(1+ qw). So 
(6.5) A(¢a,¢w) =q ‘w'A(q,w), 
i.e. 
Yo an(qhaw” = D7 an(g)q tw"? = D7 anss(q)q tw”. 


We deduce that an4i(q) = ¢?"*t'an(q), then that 


2 


(6.6) an(q) = q” ao(q), 
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first for n > 0, in fact for all n since a_n(q) = an(q) because of the invariance of 
the infinite product under w +> w~'. Hence 


(6.7) A(q,w) = ao(q) J(q, w) 


for all w € C* and |q| <1. 
For w = 1, A(q,w) and J(q,w) reduce to the two sides of identity (1), and so 
are equal. Thus 


A(q, 1) = ao(q)A(q, 1). 


As |q| < 1, none of the factors of the infinite product A(q,1) is zero, so that 
A(q, 1) #0 (Chap. IV, n° 17, theorem 13). Thus ao(q) = 1, qed. 


The 6@(z) series is that value for u = 0 of the function 


(6.8) O(u, z) _ os erin? 2t+2nnu =s 3 q” w 
Z 


Z 


where g = exp(ziz), w = e(u) for Im(u) > 0, wu € C. We will again come across it 
in the theory of elliptic functions. 

As was mentioned in Chap. IV, this history of this formula is somewhat strange. 
Jacobi had discovered before 1828 the significance of the left hand side series, Abel 
having, for his part, used the right hand side infinite product. When Jacobi pub- 
lished his formula, Gauss wrote to him that he was familiar with his result ever 
since 1808, greatly outraging Legendre who accused Gauss of wanting to appropri- 
ate Jacobi’s result. Nonetheless, Gauss (1777-1855) was right as was realized when 
his secret paper were published in 1868. He had became interested in these ques- 
tion even before 1800, used to write down his results every day in his diary and, 
Houzel relates that,!> he had obtained most of Abel and Jacobi’s infinite product 
expansions of elliptic functions as well as many more results of the same type. One 
wonders why he never published them. 

To prove (4), there are methods that do not require algebraic calculations on 
series.!° G. H. Hardy and E. M. Wright classic An Introduction to the Theory of 
Numbers, as well as Remmert, give the simplest proof. Its starting point, replacing 
w by q’?w, is already in Gauss, and as was seen, immediately leads to relation (6). 
We then check through ingenious calculations that 


A(q, 7) =A (q*, -1) ’ J(q, 1) =J (q*,-1) ’ 


from which we deduce that ao(q*) = ao(q). As ao(q) is obviously a power series in 
q, we conclude that ao(q) = ao(0) = A(0, w)/J(0, w) = 1. 


Let us give two striking consequences of (4). 


Corollary 1 (Euler). 
(6.9) Il (l—q")= Se Cieeeetie 


n>1 Z 


Replace q by q?/? and w by —q2 in (2). 


'8 See his article on the history of elliptic functions in Jean Dieudonné, Abrégé 
d’histoire des mathématiques (Hermann, 1978), vol. 2, pp. 34-40) 
19 See for example G. E. Andrews, Proc. AMS, 16, 1965, pp. 333-334. 
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The previous formula can also be written as 


n(z) = So (-1)™ exp [(6m + 1)?riz/12] 


and can be used to prove the functional equation of the 7 function in just a few 
lines; see n° 8. 


Corollary 2 (Jacobi). 
[TT aa")? = oan? = 


n>1 Z 


(6.10) = S\(-1)"(Q2m 4 Ign”? 


Replacing q by q? and w by -qiw in (4) gives 
TewyrgetP? = TT 4") (1 aw) (1g tw) = 
= (-w) [[G=0) 0a") ia"), 


(ey gn? = wt TT 14") (A= aw) (1 a") 


n>1 


As w tends to 1, the right hand side, which converges normally in every annulus 
0<r<|w| < R < +o, tends to product (10). The left hand side tends to the 
derivative of the Laurent series at w = 1, qed. 


7 — The Reciprocity Law for Gauss Sums 


(i) Cauchy’s method. As was seen in the proof of (6.1), the behaviour of the 0(z) 
function in the neighbourhood of z = 1 on the real axis in given by 


(7.1) O(1 — 1/z) © 2(z/i)/2e™*4 , 


as z tends to infinity in the fundamental domain F. More generally, one can work in 
the neighbourhood of a rational point on the real axis and thus prove analytically 
one of Gauss’ most famous discoveries. Cauchy was the first to notice this. 

For this, we set z = 2p/q+it where p and q are non-trivial rational integers and 
where t tends to 0 through positive values or, more generally, so that —1/z tends 
to infinity on F. As —1/z ~ i/t, this means that Re(1/t) tends to +00. 

This said, 


8 (2p/q + it) = }/ exp (2in®p/q — nt) . 


As exp(2miz) = e(z) = e(z + 1), the value of exp(2min?p/q) = e(n?p/q) only 
depends on the class of n mod q. So we also have 


(7.2) 6 (2p/q + it) = a, e (n’p/q) Ss exp [—1(n + mq)’¢] . 


n mod q 
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The Poisson summation formula shows that, for all n and Re(t) > 0, 


De exp [ (n+ mq)’ t] aa> exp ( (—mm? /q?t + 2nimn/q) : 


where the argument of . must be chosen between —7/4 and +7/4. As 


|exp (—1m? /q?t + 2nimn/q)| = exp [—1m? Re(1/t)/q’] ; 


the series obtained converges normally in Re(1/t) > r for all r > 0. It is, therefore, 
possible to pass to the limit term by therm as Re(1/t) increases indefinitely. How- 
ever, except for the term m = 0, all the terms of the series then tend to 0. Thus, 
for given n, 


1 


(7.3) S| exp [- (n+mq)*t] ~ (q?t) ? as Re(1/t) — +00. 


Substituting this result in (2), we get 


B 


(7.4) 0 (2p/q + it) ~ G(p,q) (at) 2, 
where 
(7.5) Gip,d = >> e(n’p/q) 


is the Gauss sum. To exploit the relation 6(—1/z) = (z/i)20(z), let us set z = 
2p/q+it and —1/z = —q/2p + iu. A few calculations show that, as Re(1/t) tends 
to +00, so does 


Re(1/u) = 4Re(1/t)p"/q° , 


with moreover 4p?u ~ q’t. Instead of (2), we write 


0 (—q/2p + iu) = ys exp (—rin?q/2p _ mu) = 


= s exp (—min?q/2p) = exp [—a(n + 2pm)? ul : 
nmod 2p m 

By (3) for g = 2p, the sum over m is equivalent to (4p?u)~2 as Re(1/w) tends to 

+oo. As 4p7u ~ qt, setting 


(7.6) G'(ap)= >_ exp (win?g/2p) = S~e (n7q/4p) , 


nmod 2p 


it, therefore, follows that 


mn 


0 (—q/2p + iu) ~ G'(-q,p) (7 t) 7. 


But the left hand side of the relation found is just 0(—1/z) = (z/i)26 0(z), where 
z = 2p/q+it. Thus, by (4), 


B 


6 (—q/2p + iu) ~ (t + 2p/iq)? G G(p,q) (qt)? 
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as well. As t tends to 0 from the right of the imaginary axis, z/i tends to 2p/iq. As 
(z/i)2 is defined by choosing Arg(z/i) between —7/2 and +7/2 for Im(z) > 0, we 
have to choose 


Arg(2p/iq) = —1/2 if p/g>0, =+4+7/2 if p/q<0. 
Hence 
(2p/iq)? = |2p/ql2 exp [—7. sen(pq) /4] = 
(7.7) = |p/q|? [1 — sgn(paq)il - 


Comparing the two asymptotic evaluations found for 6(—q/2p + iu), we finally get 
the equality 


(7.8) Ip|- 2G" (—a,p) = lal”? G(p, @) [1 — sen(pq)i] 


In particular, this result applies for p = 1. Then G’(—q, p) = 1+exp(—7igq/2) = 
1+% 7, and so 


(7.9) ll ?G(,q) = (1+4-*) / [1 —sen(q)¢] 
and (Gauss) 


1+% if q=0 mod4 


“a —1l4i7 1 if q=1 mod4 
(7.9’) q ?G(1,q) = T¢i SiO if q=2 mod4 Ge) 
i if q=3 mod4. 


In the general case to which we now return, sum (6) can be calculated differently. 
For this observe that, if p is odd and 4 1, then each class mod 2p is obtained once 
and only once by setting n = 2u + pu where wu varies modp and vmod 2, which 
transforms the general term of (6) into e(u?q/p)e(v2pq/4). The sum over umod p 
equals G(q, p) and the sum over v mod 2 equals 1 + i??. Hence 


(7.10) G'(q,p) = G(a.p) (1+ 2%) , 
a result which, for p and q odd, can also be written 
(7.10’) G'(q,p) = G(q,p) [1 + €(pa)i 
by setting 
e(n)=1 ifn=l1lmod4, =-1 if n=3mod4. 


By the way, notice the useful relations 
n = e(n)mod4, e(—n) = —e(n), 
e(mn) =e(m)e(n), 144" = 22 [1 +e(n)i] . 
Comparing (8) and (10) and observing that 
G(—4,p) = Ga, P) 


it follows that, for p and q odd, 
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i _1 ; 2 
(7.11) Ip|”? Ga, p) = lal? G(p, g) [1 — sen(pq)¢] / [1 — €(pq)i] - 
Before deducing the arithmetic consequences of point (iii), we give a very different 


method to obtain these formulas. 


(ii) The Dirichlet method. These formulas have many proofs, including of course 
arithmetic ones. Dirichlet had the idea of applying his theorem on Fourier series 
(Chap. VII, §3, n° 11, Theorem 7 bis) to the function with period 1 on R given by 


(7.12) f= So el(t+k)p/q] for 0<t<1. 


0<k<q-1 


As t € ]0,1[ tends to 0 or 1, its limit values equal G(p, q), and since it has right and 
left derivatives everywhere, 


(7.13) G(p,q) = >_ f(n), 


the sum of the series being the limit of its partial symmetric sums. We are going to 
deduce relation (8), which is not essential in the proof of reciprocity formula (11). 

To make the clever calculations of the general case more understandable, let us 
first suppose p = q = 1, so that G(p,q) = 1. Here, 


fn) = | e (t? + nt) ar =e(-n?/a) | e [(t + n/2)”] dt = 


(n+2)/2 
Sea?) if e (12) dt. 
n/2 


So, by Dirichlet’s theorem on Fourier series, 


(n+2)/2 5 
1=S e(-n /4) e(t”) dt. 
% n/2 
Setting n = 2m or 2m + 1 according to the case, this becomes 
m+1 , 5tm41 5 
l= - 
By e (t?) dt ae) e (t?) dt 
Z vm Zo“ gm 
Setting t? = u reduces to proof to integrating the functions |u|~ 2 cos(27u) 


and Jul ~2 sin(27u) over R, which leads to (not absolutely) convergent integrals 
(Chap. V, §7, n° 24, Theorem 23), thereby giving the formula 


(7.14) [ow (2rit”) dt = 1/(1-4) = (1+%)/2 


which is going to prove useful later. The reader will deduce the Fresnel integrals 
J cos(2rt*)dt and f sin(27t)dt extended to Ry. 

Exercise 1. Recover (14) by integrating the function exp(27iz) along the closed 
contour consisting of the interval [0, R] of R, of the arc 0 < Arg(z) < 1/4 of the 
circle centered at 0 of radius R, and of the line segment connecting the endpoint of 
this arc to the origin. Jordan’s lemma (Chap. VIII, n° 8, (iv)) will be useful. 
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Exercise 2. Recover (14) starting from the integral of exp(wit?z), Im(z) > 0, 
and passing to the limit. Show that 


B 


[fo exp (mit *z) dt = (z/i)? [ro exp (—mit ? /z) dt 
for all f € S(R) and Im(z) > 0. Deduce that the Fourier transform of the distribu- 


tion? exp(mit?)dt is e~7*/+ exp(—mit?) dt. 
Let us now consider the general case. 


f(—n) = Sy, e [na + (a+ k)’p/q| dx =[ e (nx + 2*p/q) dx 


since k varies from 0 to gq — 1. Second degree trinomial theory then shows that 


m ‘ q+nq/2p 9 
f(-n) =e(-n alt») f e (2° p/q) dx 
nq/2p 
As the exponential only depends on the class of n mod 2p, 
(m+1)q+nq/2p 
G(p,q)= >> e(=n*q/4p) ay e (x"p/q) de = 
n mod 2p qtnq/2p 
= > e (—nq/4p) | e (x?p/q) dx 
nmod 2p R 
So, setting t = p/q, 
(7.15) Glp.4) =G'(-a.p) f (te )ae, 
R 


where G’(q,p) is sum (6). By (14), the integral equals Ip/al~2/[L — sgn(pq)i] and 
we recover relation (8) obtained by Cauchy’s method. 


(iii) The quadratic reciprocity law.2! If p is a prime number, then the ring 
F, = Z/pZ of integers mod p is a field, the multiplicative group G(p) then being 
the set of its non-trivial elements. As it is commutative, its image under the map 
x —> x” is a subgroup G (p)+ whose number of elements is equal to the quotient of 
Card[G(p)] = |p| — 1 by the number of solutions of 2” = 1, namely 2 if p is odd as 
will be assumed. Integers that are not divisible by p fall into two disjoint sets: the 
set G(p)* of quadratic remainders mod p, and the set G(p)~ of non-squares. Then, 
for n € Z non-multiple of p, we set 


7 a +1 if ne G(p)t 
(7.16) (2) a (4) = 0 if n=0 modp 
P \p| -1 if née G(p)~ 


20 See Chap. VII, n° 32. Generally speaking, (t)dt denotes the distribution f —> 


J F(t)p(t)at 


! This section assumes that the reader is familiar with the definitions of n° 9. 
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Since there are only two cosets modulo G(p)t in the group G(p), the Legendre 
symbol that has just been defined clearly satisfies 


an (F)-G)G) 
Pp Pp Pp 
for all m,n € Z. Hence it is a character mod p in the sense of n° 9, (iii). 
G(p)* and G(p )~ having the same number of elements, the cardinality of G(p)t 


is (|p| — 1)/2. So™ 


(7.18) nilPI-D/?2 41 for all n € G(p)* 


as well as 
nl =nmodp forall ne Z. 


This is “Fermat’s little theorem” used by cryptographers. 

In fact, relation (18) characterizes squares since, being an algebraic equation of 
degree (|p| — 1)/2, it cannot have more that (|p| — 1)/2 roots in the field Fy. 

(18) enables us to determine whether —1 is or is not a quadratic remainder 
mod p. The first case occurs if and only if (—1)“?!-)/? = 1 in Z/pZ, hence in fact 
in Z since p is odd. In other words, 


(7.19) (=) 24 2 (is) =e) een) 


This result will prove useful. Recall that e(n) = 1 or —1 according to whether odd 
n equals = 1 or 3mod 4, so that n = e(n) mod 4. 

Let us now return to the Gauss sum G(p,q) by assuming that p and q are odd 
primes. Setting e(z) = exp(27iz), 


(7.20) G(p,q)= S> e(n’p/q) =1+2 S> e(mp/a) 


nmod q G(q)t 


since any quadratic remainder can be written in two ways as n”. We then distinguish 
between the two cases. 

If p € G(q)*, the map m +> mp permutes the elements of the group G(q)* 
Hence 


(7.21) G(p,q) =1+2 S > e(m/q). 


G(q)t 
This result is independent of p, and so is equal to its value for p = 1. Thus 


(7.22) G(p,q) = G(1,q)_ if pe G(q)t 


22 Tf G is finite group and if x € G, the powers x” are not pairwise distinct. Thus 
there is some n > 0 such that 2” = e, the unit element, and the smallest n with 
this property is obvious the cardinality of the subgroup G(x) generated by x. As 
G is the union of cosets G(x)y containing the same number of elements as G(z), 
the cardinality of G is a multiple of the cardinality of G(x). Thus c©""(@ = e 
for alla € G. 
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If, however, p € G(q)~, then m+—> mp is a bijection from G(q)* onto G(p)~ 
and 


G(p,q) =1+2 S© e(m/q) = 
G(q)- 
(7.23) =14+25 > e(m/q)-2 S> e(m/q). 
G(q) G(q)t 


The sum over G(q) would be zero if m varied over all of Z/pZ (sum of the roots of 
unity qg°); as summation is over all m 4 0, it equals —1. Taking (21) and (22) into 
account leads to 

(7.24) G(p,q) =—-G(1,q) if pEeG@)” 


Both cases are unified by 


(7.25) G(p,q) = (2) G(1,q). 


Exercise. Set 


Show that 


G(p,q)= Y> x(n)e(np/q) = G(x, 4) 


n mod q 


in the notation of n° 9, (iv) and deduce (24). 
Let us now come back to relation (11). For n odd, i” = e(n)i. Thus (11) now 


becomes 
w* (4) Ga = la} (2) ca, 
(710) pl 2a) = lal 2G, 4) [1 — sen(pq)i] / [1 — e(pa)i 
But, we saw above that for an odd prime q, 
jal 2G(1,q) = (144-9) / [1 — sen(q)e] = [1 — eCa)d] / [1 — sen(q)al 


and that there is also a similar relation for p. It then follows that 


(2) (2) _ _1~e(q)i_1+sgn(p)i 1 — sgn(pq)i_ 


(7.25) aq) \p 1—sgn(q)i 1t+e(p)i 1—€(pq)i 


The right hand side being equal to +1 or —1, it suffices to calculate its argument 
mod2n. For a = +1 or —1, the argument of 1+ ai equals a7/4. Mod 2z, the 
argument of the right hand side, therefore, equals ka /4, where 


k = —e(q) + sgn(p) — sgn(pq) + sgn(q) — €(p) + €(pq) = 
= [e(p) — 1] [e(q) — 1] — [sgn(p) — 1] [sgn(q) — 1] . 
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Since e(p) = pmod 4, we finally get the quadratic reciprocity law 


(6) 9)-crtem 


which holds for positive odd primes p and q or only when they are not both negative. 
The result changes sign otherwise. There are easy generalizations where p and q are 
no longer assumed to be prime. They can be found in all number theory textbooks. 

Legendre guessed the formula and Gauss attained instant fame by proving it. 
Finding generalizations, for example for rings of algebraic integers, or other proofs 
was a national sport for the German dynasty created by Gauss until the rest of the 
world, from Japanese Takagi in 1920 to Chevalley some ten years later, discovered 
the topic, and after 1935, expanded it. Governed by a High Commissioner rigor- 
ously monitoring the alignment of the Great Pyramids, it is today one of the most 
respected domains in mathematics. 
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8 — The Functional Equation of n(z): bis 


Consider once again the formula 


n(z) = exp(miz/12) > (1— 9°") = $0 (-1)™ exp [(6m + 1)?riz/12] = 


n>1 Z 


(8.1) = SCexp [(12m + 1)?riz/12] — S© exp [(12m + 7)? riz/12] 


which follows from the Jacobi identity (Corollary 1). Let y(n) denote the function 
whose values, depending only on the class of n mod 12, are given by the following 
table: 


n 012 3 4 
sae: OO OS OS OO OC 


Oo 
or) 
xl 
oe) 
Ko) 
= 
j=) 
a 


Clearly, x(n) 4 0 if and only if n is coprime to 12 and 


(8.2) x(mn) =x(m)x(nr),_ x(—n) = x(n) 


for all m,n € Z. With this notation, (1) can also be written 


(8.3) 5 5 xn) n) exp [in els 


If indeed y(n) = 1, then either n = 12m +1, or n = 12m+11 = —(12m’ +1) with 

m’' = —m—1. Because of the 3 factor, the contribution of these values of n to series 

(3) is equal to the first series (1). If, however, x(n) = —1, then either n = 12m+7, 

or n = 12m+5 = —(12m’ + 7) with m’ = —m — 1. Hence the second series (1). 
That being so, let us write 


2 xtn x(n) exp [in ef 12\'= xc x(a) S| exp [wi(a + 12n)*z/12] 


amod12 Z 
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and apply the Poisson summation formula to each subseries. After a few lines of 
calculations, we find 


Ss exp [mi(a + 12n)*z/12] = (12z/i)~ => exp | [-min? /12z + rian/6] . 
In consequence, 
(122/i)2 n(z = So x @) peel [-min?/12z + rian/6] é 
amod12 


As the factor exp(zian/6) = e(an/12) only depends on the class of amod 12, the 
result can also be written as 


(8.4) (12z/i)~ 2 =S> a x.) exp (—7in 7/122) 


setting 


(8.5) G(xy,n) = s- x(a)e(an/12) = So x(a x(a) exp(zian/6) , 


where summation is extended to the four classes a = 1,5,7 and 11mod12. The 
result only depends on the class of n mod 12 since e(z + 1) = e(z). 

Setting w = exp(zi/6), which implies w® = —1 and w’* = 1, and taking into 
account the values of x(n), we get 


G(x, n) _ w” _ we” = we” ae wil? a: w™ =f: w™ ay (i'r (w™ zi wis) _ 
=2[1+ (-1)"""] cos7n/6. 
Like x(n), the result is trivial if n is even and, if n = 2m +1, it equals 4cosn7/6. 


As cos 1/6 =cos 1117/6 = V3/2, cos 37/6 =cos 97/6 =0 and cos57/6 = cos 7x/6 = 
—V/3/2, it finally follows that 


(8.6) G(x,n) = V12x(n) 


for all n. 
Hence relation (4) can now be written as 


n(z) = (12z/t)~ 2122 S> x(n) n) exp [ [—min? /122] = (z/i)~ 2 n(-1/2), 


where the summation is over Z, thereby leading to a functional equation of the 
Dedekind function by a much simpler method than Weil’s; but it assumes the Jacobi 
identity. 

By the way, note that, by (3), the Mellin transform of t -> n(it?) — 1 is the 
series 


3 xn ) [ exp —nn?t? /12) t°d*t = (m/12)°? (8/2) > x(n) /n’ 


n>1 


The relation n(it~?) = tn(it?) immediately gives a functional equation for it in 
st >l1—s. 
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9 — Arithmetic Interlude 


(i) Quotient rings. A ring (commutative in what follows) with unit element is a set 
A with two operations (x,y) +> «+ y and (x,y) +> zy satisfying all algebraic 
rules, including the existence of elements 0 and 1 that had better be assumed to 
be distinct short of a fascination for properties of rings with no elements or only 
one. So called elementary arithmetics is the study of the ring Z. A field is a ring 
in which every x # 0 has an inverse and where 1 ¥ 0, so that a field always has 
at least two elements, the set {0,1} equipped with the obvious rules is effectively a 
field with two elements. 

If A’ and A” are two rings, a ring structure can be defined on the Cartesian 
product A’ x A” by setting 


(ere) A (y',y") =e (a +2", y/ +y") : (2°) : (y',y") = (ee ae) : 


Identifying every x’ € A’ with the ordered pair (x’,0) and every x” € A” with the 
the ordered pair (0,2), then gives (x, 2’) = 2’ + 2”, but the product 


eo (a 0) (Ore) = (0,0) 0 
in A’ x A” is not equal to 
Gey — (a, 1) (1, 2”) ; 
An ideal of a ring A is an additive subgroup I of A such that 
ctEA & yEelT=rzyel. 


For example, the set «A of multiples xy of x € A. These are the only ones in Z, for 
if, in a non-trivial ideal J of Z, d denotes its smallest element > 0 (such an element 
exists since « € J =» —a € I) and if every x € Z is written as 


zx=dq+r withqeEZ, O<r<d, 


the remainder r = x — dq of the Euclidean division of x by d, which is in J if x € J, 
is trivial as it is < d; whence I = dZ. Clearly, 


dZ Dd aZ => dla, 


where we use conventional notation to say that d divides a. 
This result shows that, for all a,b € Z, 


aZNbZ=mZ, 


where m is the lcm of a and 6b: the common multiples of a and 6 are precisely the 
multiples of m. 
As the sum aZ + bZ is also an ideal of Z, likewise 


aZ + bZ = dZ 


for some number d = au + bv. As dZ contains aZ and bZ, d is a common divisor of 
a and b. Any other common divisor divides au+ bu, hence divides d. In other words 
d is the gcd of a and 6, and is denoted d = (a, b). It is easily seen that ab = md. If 
(a,b) = 1, a and b are said to be coprime. Equivalently: all x € Z is of the form 


(9.1) x =au-+t bu (Bezout’s identity) , 
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whereas, in the general case, this property assumes that x is a multiple of (a,b). In 
all cases, 


a=da', b=db' with (a’,b')=1. 


If J is an ideal of a ring A, the relation x — y € J is written x = ymodJ, The 
coset modI of x € Ais the set r+] of x+y where y € J. For x,y € A, it is clear 
that the cosets mod/ of x + y and zy only depend on the cosets of x and ymod I. 
This defines an addition and a multiplication in the set A/I of cosets mod I, which 
enables us to talk of the quotient ring A/I. 

The coset of a € A is invertible in A/J if and only if there exists u € A such 
that au = 1modJ. Then x = auxmod/ for all x € A. Hence A = aA-+ IJ, and 
conversely. In the case A = Z, I = bZ, this means that there are integers u and v 
such that au + bv = 1, in other words that (a,b) = 1. Like in any ring, the set of 
invertible elements of A/I is a multiplicative group (A/J)*. The groups 


G(m) = (Z/mZ)* 


play a particularly important role in arithmetics. 

A/T is a field if and only if A= aA+TI for all a not belonging to J. This means 
that the only ideals J containing I are J and A since J D> aA+T for alla € J. I is 
then said to be a maximal ideal of A. In the case A = Z, I = pZ, the ideals J = dZ 
containing J correspond to the divisors of p. So the ideal pZ is maximal if and only 
if p is a prime. Equivalently: if p divides xy, it divides x or y. Indeed this relation 
means that, in the quotient ring Z/pZ, the relation xy = 0 implies x = 0 or y = 0, 
in other words that Z/pZ is an integral ring or that, for all « # 0 in Z/pZ, the map 
y'—> zy is injective. As Z/pZ is a finite set, this map is bijective and z is invertible 
in Z/pZ. The quotients F, = Z/pZ, p = 2,3,5,7,11,... are the simplest examples 
of finite fields. 

All integers are products of prime factors. This is written as 


n= e]]p”™ 


with a factor « € {—1,1} and exponents vp(n) € N all of which except a finite 
number are trivial. For the gcd (resp. lcm) of two integers m and n, the exponent 
p is the smallest (resp. the largest) of all integers vp(m) and vp(n). So any prime 
factor of the [cm is a prime factor of m or of n (or both). 

If J and J are ideals of a commutative ring A and if I D> J, relation x = ymodI 
implies x = ymodJ. Hence the coset mod J of x € A only depends on its coset 
mod I. This gives a “canonical” map 


A/I — A/J 
which is a ring homomorphism. If J, J, K are ideals such that J Cc J C K, the map 
A/I —> A/K is clearly composed of maps A/I —> A/J and A/J — A/K. 


If m= m'm" in Z, we get a canonical ring homomorphism 
? 


(9.2) Z/m'm"Z — Z/m'Z x Z/m"Z 
by associating to the class mod m of every x € Z its classes mod m’ and m”. 


Lemma 1. Homomorphism (2) is injective if and only if (m',m") =1; it is then 
bijective. 
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The injectivity of (2) means that any common multiple of m’ and m” is a 
multiple of m’m”, hence that m'm” is the lem of m’ and m”. As this Icm is always 
equal to m’m"/(m',m"’), condition (m’,m") = 1 follows. Conversely if the latter 
holds, the surjectivity of (2) remains to be proved, in other words that, for all 
a’,a” € Z, there exists « € Z such that both « = a’ modm’ and x = a” modm” 
hold. However, by Bezout’s theorem, there are integers u’, uv” such that 


! " uo noe 
a-a =muU—-mu. 


The number x = a’ + m’u’ =a” + mu" 


then answers the question. 


(ii) The groups G(m) ; characters mod m. For any integer m > 2, as was men- 
tioned above, we set 


G(m) = (Z/mZ)" . 


This is the multiplicative group of classes of coprimes to m, for example the classes 
of 1,5,7 and 11 if m = 12. The canonical map A/IJ —> A/J defined in the general 
case for the ideals J and J D I induces a homomorphism from the multiplicative 
group (A/J)* to the multiplicative group (A/J)*. For A = Z, there is a more precise 
result: 


Lemma 2. Let m and m’ be two integers such that m'|m. The canonical homo- 
morphism G(m) —>+ G(m’) is surjective. 


The proof reduces to showing that any a’ such that (m’,a’) = 1 is the class 
modm’ of some a such that (m,a) = 1, i.e. that there exists x such that (m,a’ + 
xm’) = 1. This means that no prime divisor p of m should divide a’ + xm’. If 
p divides m’, it divides xm’ but not a’ since (a’,m’) = 1. For these primes, the 
condition thus holds for all x. If, on the other hand p does not divide m’, there are 
two possible cases. If p divides a’, any x not divisible by p is suitable. If p does not 
divide a’, any x divisible by p is suitable. So the proof reduces to including in the 
list of prime factors of x all the prime factors of m dividing a’ (they do not divide 
m') and to excluding from it those that divide a’, which is possible since these sets 
of primes are disjoint, qed. 

Let us suppose that m = m’m". Homomorphism (2), namely 2 +> (2’, x”), 
defines a homomorphism from G(m) to the multiplicative group A* of invertible 
elements of the ring Z/m’Z x Z/m"Z = A. The unit element of A being (1, 1), its 
invertible elements are the ordered pairs (2’,2”) with x’ € G(m’), x” € G(m”). 
If, for all  € Z/mZ, x’ and x” denote the images of x in Z/m’Z and Z/m"Z, 
homomorphism (2) transforms x into (x’,x’) € A and the map 


(9.3) G(m) — G(m') x G(m") 


thus obtained is obviously a homomorphism from the group G(m) to the Carte- 
sian product of the groups G(m’) and G(m”). As the multiplicative groups of two 
isomorphic rings are isomorphic, lemma 2 shows that (3) is an isomorphism if and 
only if (m',m") = 1. As (a’, x”) = (a’,1)(1,2”) in the Cartesian product A, the 
result can also be formulated as follows: 


Lemma 3. Let m’ and m” be two coprimes and m = m'm”. Let G'(m) (resp. 
G"(m)) be the subgroup of x =1(modm”) (resp. (modm’)) in G(m). Then every 
x € G(m) is uniquely expressible as the product of an element of G'(m) and of an 
element of G(m) and these subgroups are isomorphic to G(m') and G(m"). 
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This enables us to calculate the number y(m) of elements of G(m), known as 
the Euler indicator of m. To start with, it is clear that y(m'’m’’) = y(m’)y(m”) if 
(m',m") = 1. Obviously, for a prime m = p, 


y(p) =p-—1=m(1—-1/p). 


If m = p’ with p prime, G(m) decomposes into classes of numbers not divisible 
by p. There are p” — p’—' of them, and so once again y(m) = m(1 — 1/p). In the 
general case, m has a prime factor decomposition and we get a product 


(9.4) y(m) =m] J -1/p) 


extended over all prime divisors of m. 

If, instead of only considering G(m), one also consider Z/mZ, the elements x 
can be classified according to the value of d = (x,m), For any divisor d of m, 
these x are the classes mod m of the numbers dy with (y,m/d) = 1. As the relation 
dy = 0(mod m) is equivalent to y = 0(mod m/d), it follows that the number of such 
classes is equal to y(m/d). Hence the formula 


(9.5) ae eee Or 
d|m 


d|m 
For an integer m 4 0,1,—1, the character mod m is said to be any function 
x: G(m) — C 
satisfying 
9.6’) x(zy) = x(z)x(y), x(1)=1 
Let us extend the definition by setting 
9.6”) x(x) =0 if (a,m) Al 


so that relation (6’) continues to hold for all x and y. 


9.7) lx(n)| =1. if (m,n) =1 


necessarily holds since the image of G(m) under x is a finite subgroup of C*. Its 
elements are roots of unity since the powers of an element of a finite group are not 
pairwise distinct. The simplest character is « +— 1; it is called the unit character. 
If x’ and y” are two characters, so is their product «++ y'(x)y""(2). 

If m’ is a divisor of m and if 7 : G(m) —+ G(m’) is the canonical map, every 
character y’ mod m’ defines a character 


x=x' on de G(m). 
Do not forgot to set x(x) = 0 for (a,m) 4 1. It is then clear that?* 
(9.8) (x,m)=1 & wx =1modm’ = x(x) =1. 


Conversely, let us suppose that this condition holds. As the homomorphism 7 : 
G(m) —+ G(m’) is surjective and as (8) means that (xz) = 1 for x such that 
a(x) = 1, relation (6’) shows that 2(x) = m(y) implies x(x) = x(y), so that x is 


?3 Be careful with the condition (x,m) = 1. A coprime x to m’ is not necessarily 
coprime to m. Equality x = x’ 0 7 no longer holds in G(m). 
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given by the composition of 7 and a well-determined character y’ of G(m’). x will 
then be said to arise from a character mod m/’ or, simply, by abuse of language, that 
y is a character mod m’. If m’|m"|m, then clearly y also arises from a character 
xy" modm”, namely that given by the composition of x’ and of the canonical map 
from G(m’’) onto G(m’). 

A character mod m is said to be primitive if it can be obtained in this way from 
a non-trivial divisor of m. This means that, for divisors m’ of m, it never reduces 
to 1 on the kernel of the homomorphism G(m) —> G(m’), in other words that, for 
every divisor m’ 4 m de m, there exists x satisfying 


(9.9) (x,m)=1, x=1modm’, x(x) 41. 


In the notation of lemma 3, this means that, for non-trivial decompositions m = 
mm", x never reduces to the unit on G’(m). 


Lemma 4. Let x be a character modm and _m’',m" divisors of m such that y is 
both a character modm’ and modm”. Let d be the gcd of m’ and m". Then x is a 
character mod d. 


Let pz be the lem of m’ and m”; it divides m, so that x follows from the 
composition either of maps 


G(m) — G(n) > G(m') > x’, 


or of analogous maps for m” and x’, where y’ and y” are the characters of G(m’) 
and G(m”’). As x cannot arise from two different characters of G(s) G(m) —> G(p) 
being surjective, it is enough to consider G(j,z), in other words to suppose that m 
is the lem of m’ and m”, whence m = m'm" /d. 

By (9), the proof then reduces to showing that 


(c,m)=1 & w=1modd=> x(x)=1. 


But, by Bezout, 2 = 1modd means there are u’ and wu” such that « = 1+ m/u! + 
mu". (m,1+m'u') = 1 for if a prime p divides m and 1+m/’u’, it could not divide 
m’ or it would divide 1, thus also m” since m is the lem of m’ and m”, hence would 
divide 1+ m'u’ +m"u" = x, which is impossible as (x,m) = 1. This being so, since 
x and 1+m/’u’ are coprime to m and equal modm”, x(x) = x(1 + m’u’) by (8) 
applied to m”. But as 1+ m’u’ is coprime to m and = 1modm’, y(1+m’u’) =1 
by (8) applied to m’, ged. 

This implies that, if d is the ged of all divisors m’ of m such that y is a character 
mod m’, then x arises from an obviously primitive character mod d. The numbers m’ 
considered are then clearly all the possible multiples of d dividing m. The number 
d is called the conductor of x; it is the unique divisor d of m for which x arises 


from a primitive character mod d. 


(iii) Orthogonality relations. The definition of characters modm can be gen- 
eralized to all groups G: a character of G is a homomorphism y from G to the 
multiplicative group C*. The simplest case is that of a finite commutative group G. 
To address it, one could invoke the general theory of Chap. XI, n° 26 and 27, but 
it is much simpler to argue directly. 

Let us consider the (finite-dimensional) Hilbert space L?(G), i.e. the vector 
space of functions f : G —> C equipped with the inner product 


(9.10) (f\g) = Card(G)~" > f(x) g(a). 
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In L?(G), the translation operator Ua, transforming the function f(a) into the 
function Uaf(x) = f(ax) for all f € L°(G), can be associated to all a € G. Trivial 
calculations show that 


(Uaf,Uag) = (f,9) 
for all a, f and g, as well as 
UU = Ua 


for all a,b € G. The former relation says that all U, are unitary and hence di- 
agonalizable. Since G is commutative, the latter one implies that they they can 
be simultaneously diagonalized, more precisely that there is an orthogonal basis 
of L?(G) whose elements are the eigenvectors of all Ua. If x € L?(G) is such an 
element, relation Uax = A(a)x with A(a) € C means that x(ax) = A(a)x(x) for 
all a and x. Thus y(a) = x(e)A(a) and so x(a)x(xz) = x(e)x(ax). As x is not the 
trivial function, one can always assume that x(e) = 1, in which case it follows that 


x(xy) = x(@)x(y)- 
The image of G under y being a finite group in C*, |y(a)| = 1 for all x and so 


(xlx) =1 


since out of precaution, we introduced a factor Card(G)~' in the definition of 
the inner product. Finally, there is an orthonormal basis of L?(G) composed of 
n = Card(G) characters of G. 

Besides, the vectors of the basis x;,1 < i < Card(G) that have been found are 
the only characters of G. Indeed, almost by definition, a character y’ is a common 
eigenvector of all Ua. If x’ 4 xi for some i, there exists x such that y’(x) 4 xi(z), 
so that y’ and y; are eigenvectors for U, with distinct eigenvalues . Hence they 
are orthogonal. As a non-trivial element of L?(G) cannot be orthogonal to all basis 
vectors, the conclusion follows. 

The equalities 


1. X= xX 
0 if x#x’ 


are the orthogonality relations for characters. 
Exercise 1. For every function f € L?(G), set 


fod = (flx) - 
Without using §7 du Chap. XI nor the Lebesgue integral prove that 


f(x) = Card(G@)~" 7 FO) x(a) 


(9.11) (xlx’) = 


for all x € G and that 
(fg) = Cara(@)* > F00900 = (1a) 


for all f,g. Analogy with Fourier series? 

Exercise 2. The product of two characters of G being also a character of G, a 
multiplication can be defined on the set G of characters of G. It transforms G into 
a commutative group, the dual of G. Show that every character of G is of the form 
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&:x ++ x(x) for some unique x € G. Deduce that x +> @ is an isomorphism 
from G onto G. Write down the orthogonality relations for the characters of Gc 
Exercise 3. Let H be a subgroup of G and x a character of H. Let L?(G; x) be 


the space of functions f on G such that f(xh) = f(x)x(h) for all h € P, equipped 
with the inner product 


(flg) = Card(G/H)* S* f(a) g(a) 


where summation is over all cosets mod H since the function being “integrated” is 
constant on these classes. Show that the characters of G equal to x on H form an 
orthogonal basis for L?(G; x). Deduce that the restriction homomorphism G—H 
is surjective. Generalize to all locally compact commutative groups. 


(iv) Gauss sums. One can construct a theta series 


(9.12) =~ x(n) f (na/m?) = x(-1)0;(—2; x) 


from a character ymodm and a function f € S(IR) as was done in n° 2 to prove 
the functional equation of the ¢ function or, less trivially, in n° 8 to prove that 
of the 7 function. As x(—n) = x(—1)x(n), the formula only involves the function 
4[ f(x) +x(-1)f(—2)]; so it is reasonable to confine ourselves functions f for which 


f(—«x) = x(—1)f(x). The reason for the factor m2 will become clear later. In the 
meantime to make calculations easier, we set 


g(x) = f(x/m?) , whence g(2) = m2 f (am?) : 


Grouping together the terms of series (19) belonging to a same class modm 


shows that 
O¢ (x3 x) = oe a) Sg a+pm)x 


Z/mZ pez 


The Poisson summation formula can then be applied to the function t + g[(a + 
tm)a]. Three lines of calculations lead to 


O¢(a;x) = (ma)~* S$) G(n/me)x(a)e(an/m) = 


a,nmodm 
(9.13) = (mz) a pen n,x)g(n/mz) 
with Gauss sums 


(9.14) Im(n,x) = S> x(a)e(an/m) 


amodm 


that most authors simply denote by G(n, x). The function e(z) = exp(27iz) has 
period 1, which justifies the summation modm, which in fact is extended to all 
a € G(m) for otherwise y(a) = 0. We also set 


(9.15) Im(x) = =S> x(a) e(a/m). 
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The functional equation of the series 6(x; x) being primarily based on properties 
of Gauss sums, in this section we will prove those that will be essential for us, and 
even a bit more. The main result is point (a1) of the next theorem: 


Theorem 5. Let x be a character modm and n an integer. Set 
(9.16) d=(m,n), m=dm', n=dn’'. 


(a) In the three following cases 


(9.17) In(n, x) = Pn) x(n) : 


(al) for all n if x is primitive ; 
(a2) for (m,n) =1 for all xy; 
(a3) x is not a character modm’. 


(b) If x arises from a character x’ modm’, then 


(9.17’) Im(n, x) = a Pint (x')x'(n') . 


(14) shows that first of all 


Im(n, x) x(y) = yi x(xy)e(an/m) for all ye Z. 


y may be supposed?‘ to be in G (m) for otherwise the previous relation would reduce 
to 0 = 0. Let z be the inverse of y in G(m). The map x+—> xy = 2’ permutes the 
elements of G(m), and as x = x'z, this change of variable in the Gauss sum shows 
that 


Tin(n, x)x() = D> x(2’)e (2'2zn/m) = Iin(zn, x). 


Several cases that need to be considered 
If n is coprime to m, one may choose y = n, in which case zn = 1modm. Then 


x(y) = x(n), and so since |x(n)| = 1, 
(9.18) Pin(n, x) = Ein(x)x(n) if (n,m) = 1, 


which proves proposition (a2) in the statement. For example, in the extreme case 
where x is a unit character, 


Im(n,1) = »y e(an/m) 


aeG(m) 


has the same value for all exponents n coprime to m. 

If n is not coprime to m, use (16), which implies (m’,n’) = 1. As e(an/m) = 
e(an’/m’) only depends on the class of amodm’, the terms of G(m) belonging to 
the same class mod m’ can be grouped together in (14), before summing over all 
classes mod m’. Hence 


(9.19) Im(n,x) = > e (xn'/m’) Da x(a). 


x modm/ a=x mod m’ 


?4 Tn this type of calculations, it is convenient not to make any difference between 
an integer and its class modm; the context always indicates the correct inter- 
pretation. 
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Since x(a) # 0 assumes (m,a) = 1, the partial sum over all a is extended to all 
a € G(m) with the same image as x in G(m’). These are xh for h in the kernel H 
of the homomorphism G(m) —>+ Gm’), whence 


S> x(a) =x(x) © x(h). 


a=x mod m’ heH 


The restriction of y to the subgroup H being a character of H, two cases are 
possible: 

If x is not a character mod m’, the restriction of x to H is a non-trivial character 
of H. So all partial sums (18) of the Gauss sum I, (x, 7) are trivial by orthogonality 
relations (11). Hence x(n) = 0, and thus we also have 


(9.20) Tn (1, x) = Im) x(n), 


which proves propositions (a1) and (a3) of the theorem in this case. 

If x is a character modm’, then x = 1 on H and the sum of all y(h) equals 
Card(H) = Card[G(m)]/ Card[G(m’)] = v(m) /p(m’). Then the value of the partial 
sum over all a= xmodm’ is Card(H)x(x)e(xn'/m’) and 


(9.21) Im(n, x) = e(m) /p(m') S>x(a)e (an'/m’) , 


where summation is mod m’ over all « € G(m). But if x’ is the character of G(m’) 
from which y arises and each x € G(m) is replaced by its image x’ in G(m’), then 
x(x) = x/(x’) and e(xn'/m') = e(2’n'/m’) since x’ = x modm’. Thus relation (21) 
becomes 


n(n, x) = (m)/e(m') ST x'(2')e (2'n'/m’) = 
x'€G(m’) 


= v(m) /e(m’) Dw (n’,x’) . 


However, by definition of m’, (m’,n’) = 1. So (18) can be applied to the new 
situation. Hence 


(9.22) Pm(n, x) = Dnt (x')x'(n'‘)e(m)/e(m’) , 

with 

(9.23) Dmi(x') = S> x'(a’Je (2'/m') , 
Gm’) 


which proves proposition (b) of the theorem. 


(v) Case of the unit character. As a useful illustration, for the unit character 
x take: x(x) = 1 if « € G(m), x(x) = 0 otherwise. Then y’ = 1 as well, hence 
x'(n') = 1 since (m’,n’) = 1. Setting 


(9.24) Iin(n) = Im(n,1) = » e(na/m) = In(—n) 
(a,m)=1 
and 
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in order to simplify notation, we therefore find 


(9.25) Im(n) = ont u(m/d) if d=m/(m,n). 


Sums (24) satisfy 
m if min, 


(9.26) Shaw ‘4 


d|m 


otherwise . 


Indeed if the restriction (a,m) = 1 is omitted in sum (24), we would get the sum 
yo e(na/m) extended to all a € Z/mZ. As at e(na/m) is an additive character 
which reduces to the unit only if m|n, the result obtained would be equal to the 
right hand side of (26). But, in the sum extended to all roots of unity, let us 
group together the terms according to the value d of (a,m). Setting m = dm’ and 
a = da’, we get (a’,m’) = 1 and e(na/m) = e(na’/m’). So the contribution from 
these classes equals I,,;q(n). As d can take any value, including d = m which 
corresponds to the term a = 0 of (24), (26) follows. 
For n = 1, (26) becomes 


(9.26’) do u(m/d) = YF ud) = ; : ou 


d|m d|m 


To compute this Mobius function, named after the inventor of the strip, first observe 
that (1) = 1 by (24) and 


(9.27.1) L(p) =—1 if p is prime 

since 1 and p are the only divisors of p. If m = p", the divisors of m are 1, p,...,p", 
and so 

(9.27.2) u(p')=0 ifr>2. 


In the general case, first observe that 
(9.27.3) (ma',m") =1 => w(m'm") = w(m’) w(m") . 


Since d = d’d”, we can identify divisors of m = m'm” with ordered pairs of divisors 
of m’ and m”. If the relation we need to show is proved for all < m= m'm”, then 
u(d) = p(d')u(d’) except possibly if d = m, in which case d’ = m’, d” =m”. But 
then (26) becomes 


pm) + SD wd’yu(a") =0 
d'|m! 


d!'|m!" 


provided the pair m’,m” of the sum is omitted. On the other hand, multiplying 


relations (26) for m’ and m”, the same relation is recovered except that u(m) is 
” v 


replaced by p4(m’)u(m"). Hence (27.3) holds by induction on m = m'm”. 
To summarize: 
1: oe: 
(9.28) u(m) = (-1)* if m=p1...pp with distinct p; 


QO in other cases . 
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The Mobius function — as well as many other analogues — is related to the 
Riemann ¢ function. Indeed, let us compute 


S$ 1/n? So u(m)/m? = S> p(m)/(man)* 


by grouping together terms according to the value k of mn. They are obtained by 
choosing an arbitrary divisor d of k and by setting m = d, n = k/d. The total 
coefficient of 1/k* in the product of the two series is, therefore, the sum >> u(d) 
extended to divisors of k, which is trivial if k > 2 and equal to 1 fork = 1. Asa 
result, 


(9.29) 1/¢(s) = 55 p(n) 


(Alternatively, we could have used (28) to get the infinite product of the ¢ func- 
tion directly). This supposes Re(s) > 1 if we are keen on convergent series. But 
the computation is purely formal and some authors even turn this into a theory 
analogous to that of formal series }*a,X” and, for example, define some sort of 
associative convolution product 


(9.30) = S> f(dg(m/d) 


d|m 


applicable to functions on integers n > 1. At least it has the not so worthwhile 
merit of satisfying the identity 


(9.31) dE f(m)/m? YF g(m)/m* = $7 f * g(m)/m*, 


m>1 m>1 m>1 
which can be written 
CrSo = reg 
using obvious notation. In particular let us take g(m) = 1 for all m, whence 
m) = 0 f(m/d) = f'(m). 
As Cg(s) = ¢(s), it follows that ¢r¢ = Cy, and so 
Cr(8) = Cpr (s)/C(8) = Cpr(s)Ou(s) = Cpran(s) - 
This implies the Mobius inversion formula: 
(9.32) = So f(m/d) <=> f(m) = S— wld) f/(m/d). 
We next consider the equality > y(m/d) = m satisfied by Euler’s indicator. As the 


function g remains the same, y * g(m) = m for all m. However, the function ¢), 
associated to h(m) = m is ¢(s — 1). Hence ¢,(s)¢(s) = ¢(s — 1), and so 


(9.33) dE e(m)/m* = ¢(s — 1)/6(s) 


m>1 
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We can at last apply the method to the function f(m) = Im(n) = Im/(|n|) for 
given n. Here f’(m) =m if m|n, = 0 otherwise. Thus, setting 


etre > (n/d)* = n*o_,(n) 


d\n 


for all n > 0 gives 


C(s)6r(8) = Cpr(s) = So (In|/m)'~* = o1-s ((n\) - 


m|n 


Therefore, we finally get the formula 


(9.34) Yo Fin(n)m~* = 01-5 (In) /G(8) = n° *o5-1 (In) /6(s) 


m>1 


which will be used to get the Fourier series expansion of the “reduced” Eisenstein 
and Maaf series. 


10 — The Series 07(x; x) and L(s; x) 
(i) Functional equation of 65 (x; x). We can now come back to the series 
=o x(n)f (na/m?) 
defined in (9.12) and to relation (9.13) 
mx +(x; x) = lent (n, x)g (n/mzx) 


with g(x) = f (x/m?) and g(x) = m2 f (wm?). It can be written 


mx ¢(x; x) =m? Se Eat (n,x)f )f (n/m? a) : 


If y is a primitive character modm, theorem 5 enables us to replace Im(n, x) by 
Im(x)x(n). So, replacing « by 1/a and setting 


(10.1) (x) = Din(x)/m? , 


(10.2) Os (1/2; x) = e(x)@O 7 (@;X) - 


Replacing f by f in (1), replaces the function f by + f(—2). So, if x is also 
replaced by the conjugate character Xx, then 


97 (2X) = €(X) x ‘Os (-1/2;x) - 
Thus, substituting in (1), 
Os (1/a; x) = (xe (X) Of (—1/a3 x) . 
As x(—n) = x(-1)x(n), 
(10.3) 0; (—2; x) = x(-1)65 (2; x) 
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for all x and f. Since the functions 6¥(x; x) are not all trivial for given x, it follows 
that 

(10.4) e(x)e(x) = x(-1) 

for every primitive character. But 


= So x(@e(a/m) = S> x@eCa/m) = Tn (Hl, x) = x(-D In) - 


Hence 

10.5) €(®) = x(-Debd 
as well, so that (4) becomes 

10.6) lex) = 1 


or, taking (2) into account, 


10.6’) [En(x)| =m? . 


As an example let us choose the even function 1/ ch 7, which is identical to its 
Fourier transform and suppose that x is primitive and y(—1) = +1. Then, by (1), 
the function 


(10.7) flax = ys x(n) / cos (mnz/m?) , Im(z)>0 
satisfies 


(10.8) f (-1/25x) = O)(@/i) F(X) 


for pure imaginary z = iz, hence on all of the half-plane, which generalizes the 
formulas found in n° 4 in the case x(n) = 1 for all n € Z. 

We could also have chosen the function 2 + exp(mix?z) where Im(z) > 0. 
Its Fourier transform is x +> (z/i)~2 exp(—mia?/z) and writing (1) for « = 1, we 
conclude that the function 


(10.9) =Sox(n x(n) exp (rin ?z/m) = = 0(z + 2m; x) 
neZ 


satisfies 


(10.10) 6(—1/2; x) = €(x)(2/A)28 (23%) 


by assuming y to be primitive and even. 


(ii) The series L(s,x). To generalize the calculations of §1, n° 2 concerning 
the Riemann series to the series 0 r(x; x), it is necessary to study their asymptotic 
behaviour as |t| tends to +00 or . 

Assuming f(—x) = x(—1)f(a#) and noting that (0) = 0 for any character 
modulo an integer, m > 1, 


1 
(10.11) Or (ask) = YE x(n)g(n2v), 
n>1 
where g(x) = f(2/m2). As g is rapidly decreasing at infinity and as |x(n)| = 1, by 


1 
3 
the ee ef : 1, n° 2, the result is rapidly decreasing at infinity. 
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In the neighbourhood of 0, the relation 


0; (a3 x) = ae ere n,x)g(n/mz) , 


where summation is over n € Z, shows that the left hand side is the sum of a 
function O(|a|%) for all N and of the additional term 


(ma)~'9(0)Im (0, x) - 


But Im(0, x) = >> x(a), where summation is over all a € G(m). This is the inner 
product (9.8) on L?[G(m)] of y and the unit character. If x is the unit charac- 
ter, then Im(0,x) = v(m), i.e. the number of elements of G(m). Otherwise, the 
result is trivial because of the orthogonality relations. Getting rid of the case of 
the unit character, which anyhow is not primitive, shows that the function 6+(z; x) 
is O(|z|") for all N in the neighbourhood of 0. As a result, the integral defin- 
ing its Mellin transform converges for all s € C and so is an entire function of s 
(Chap. VIII, n° 13). 
A formula calculation shows it is equal to 


[O° astevroa°are = 2D xim) f 


n>1 0 


+00 1 
f (na/m? ) vd'r= 
(10.12) = m°/°T;(s)L(s; x), 
where I'y(s) is the Mellin transform of f and where the Dirichlet series 
(10.13) L(s;x) = >> x(n)/n 

n>1 


converges absolutely for Re(s) > 1. In this domain, term by term integration is 
justified because of 


> | |t (na/m4) x*|d*a < +00, 


which amounts to the convergence of the Riemann series since the integral defining 
the Mellin transform of f € S(R) converges absolutely on Re(s) > 0. Hence this 


calculation shows that, if x is not the unit character, the product m*/?I;(s)L(s; x) 
is the restriction of an entire function of s to the half-plane Re(s) > 1. 
As this assumes f(—2) = x(—1)f(a), one can for example choose 


(10.14’) f(a) = exp (—12?) = f(z) if x(—1) = +1, 

(10.14”) f(x) = naexp(—1a?) =if(x) if x(-1) =-1. 

As shown by the calculations of §1, n° 1, the first case leads to the function 
(10.15’) A(six) = (n/m) *PP(s/2)L(s;x), x(-1) = +1; 

and in the second one, multiplying the result by the cosmetic factor m2 gives 
(10.15”) A(s;x) = (/m)~ OF? (1 + 8)/2] L(s;x), x(-1) =-1. 

As 1/I(s) is an entire function, so is the series L(s; x). If x is even, it has zeros 


at s = 0,—2,... like 1/I°(s/2). If xy is odd, it has zeros at s = —1,-—3,... like 
1/L[(1 + s)/2]. 
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Taking into account the functional equation 


Oy (1/23 x) = €(x)xOy (x X*) , 
which holds if x is primitive, finally leads to the next result: 


Theorem 6. Let m be an integer > 1 and x a primitive character modm. Then 
the function 


= So x(n)/n 


is the restriction of an entire function to the half-plane Re(s) > 1, and the function 
A(s;x) defined by (15’) or (15”) satisfies the functional equation 


(10.16) A(1 — 83x) = €(x)A (s;k) if x(-1) = +1, 
(10.16”) A(1 — 8; x) = —ie(x)A (s;k) if x(-1)=—-1. 


As a first example, let us take m = 3 and the character x(n) = 1 ifn = 1mod3, 
x(n) = —1 if n = 2mod3. This is the only possibility in this case. Then 


(10.17) L(s;x) = 50 1/(8n + 1)? — S°1/(8n + 2)° 
where summation is over all n € N, including 0. As 

= 5° x(a) e(a/m) = exp (21/3) — exp (4777/3) = 324, 
€(x) = 1%. Hence, since ¥ = x, (16”) shows that 

A(1 — 8;x) = A(s;x), 

where 

A(s;x) = (n/3)~ OP (1 + 8)/2] L(8;x). 
The reader who is not very familiar with these calculations should completely redo 


them in this particular case. 
For m = 4 and the character used at the end of n° 1 of §1, 


Ta(x) = e(1/4) — e(3/4) =i — #8 = 21 = 423, 


whence e(x) = 7. As x(—1) = -1, 
ACs; x) = (4/4) OF PTL + 8)/2)L(s; x). 


Relation (16”) shows that A(s; x) = A(1 — s;x), as obtained in n° 1. 
For m = 12 and the character x used in n° 8, formula (8.6) shows that I2(x) = 


122, whence e(y) = +1. As x is even, formula (16’) applies. Here 


L(s;x) = >) 1/(12n +1)* — 1/(12n +5)° — 1/(12n +7)° 4 
(10.18) +1/(12n +11)*, 


where summation is over n > 0. 
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Like the Riemann ¢ function, the functions L(s;x) have an infinite product 
expansion 


(10.19) L(s;x) =[[-x()/p'", Re(s) >1, 


extended to all prime numbers not dividing m since y(p) = 0 for the others. Indeed, 
for all p, 


[1 — x(p)/p*] L(s;x) = >> x(n)/n® — S° x(pn)/(pn)’, 


which removes from the series all terms for which p divides n. The end of the 
argument ends is like in the case of the zeta function. (19) shows that the series 
L(s;x) are £0 for Re(s) > 1. 

(19) does not suppose x to be primitive. If f is the conductor of x, i.e. the least 
divisor f of m such that x is a character mod f, and if y’ denotes the primitive 
character mod f from which y arises, expansions (15) of L(s;y) and L(s; x’) are 
the same except that factors corresponding to p coprime to f but not to m appear 
in L(s; x’), but equal 1 in L(s; x). Hence 


(10.20) L(s;x') = L(s;x) [[ 1 -x'@)/e"]" » 


where the product is extended to all p dividing m but not f. As theorem 6 applies 
to the left hand side, it can be deduced that, even if x is a primitive character, the 
L(s;x) series extends analytically. 

Finally note that there is a property of the series L distinguishing them from 
the Riemann ¢ function: if x is not the unit character, the series L(s;x) converges 
uniformly”? in the half-plane Re(s) > o for all o > 0 whereas, for the ¢ function, 
o > 1 would need to be assumed. 

These series having been invented by Dirichlet, it is not surprising that the 
result can be obtained by the arguments used in Chap. III, §3, n° 11 to prove the 
convergence criterion by the same author. Recall that it is about a numerical series 
> UnvUn whose partial sums are assumed to be bounded and the sequence vn > 0 
to tend decreasingly to 0. These assumptions are too restrictive. In fact, un and un 
may be taken to be complex, in which case the assumptions to be checked are 


(1) the sums s, are bounded, 
(2) limvn =0, 
(3) Slongi — eal < +00. 
Indeed let us set tn = uivi +...+UnUn. The proof reduces to showing that, for all 


€ > 0, |t¢ — tp_1| < € for sufficiently large p and q. To this end, as in Chap. III, let 
us write 


tq — tp—-1 = —Sp—1Up + Sp (Up — Up4i) +... + 8q—1 (Vq—1 — Va) + Sq¥q- 


Assumption (1) provides an upper bound 


|tq — tp-1| < M |vp| + M |vg| + M (vp — vp4al +... + 
(10.21) + |vg—1 — Uql) 


25 Recall that, for a series of functions, uniform convergence is by definition equiva- 
lent to that of its partial sums. It is obviously guaranteed by normal convergence, 
but does not hold for the L-series. In the case of a series of holomorphic func- 
tions, uniform convergence ensures that the sum also converges uniformly and 
its derivatives can be computed by differentiating the given series term by term. 
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assumption (2) shows that the first two terms tend to 0 and assumption (3) that 
so does the third one, whence the general theorem. 

In the particular case at hand, set un = x(n) and vp = 1/n*. The first assump- 
tion holds . As x(n +km) = x(n), the sum of x(n) as n varies from km to (k+1)m 
is indeed equal to the sun of x(x) extended to x € G(m), and so is trivial since 
x is not a unit character. Hence setting n = mq+r with 0 < r < q, 8n = Sr. 
As a consequence, there are finitely many possible values for s, and a constant M 
independent of the parameter s. 

To show that convergence is uniform in the half-plane P defined by Re(s) > 0 
with o > 0, it is necessary to show that, for all « > 0, there is an integer N 
independent of s such that 


sE€P & p,q>N => tq —tp-1| <e. 


As |un| = 1/nR°) < 1/n7, the first two terms of the right hand side of (21) do not 
create any problems. Finding an upper bound for the differences 


(n+1) 8 =n“ [1—(14+1/n)~*] 


s 


Un —Unt1 =n 


amounts to finding an upper bound for |(1 + z)~* — 1) for z = 1/n. Newton’s 
binomial formula shows that (1+ z)~*° —1 = —sz+O(1/z*) as z tends to 0, whence 
Un — Un +1 = O(1/nst!). This result is sufficient to prove convergence, but not 
that the latter is uniform. The derivative of the function t ++ (1 + tz)~° being 
—sz(1+tz)~°~1, TF shows that 


1 
\((1+2)°-1| =| s | (14 tay ay <|sz|. sup [1 +tz|7 Pe? 
) O<t<1 


For z = 1/n, |1 + tz| varies between 1 and 1+ 1/n. As the exponent is < 0, sup is 
attained at t = 0. Thus 


o-l1 


lun — Unga < |s|n7 BO“? < [s\n 


in the half-plane P and so 


-1 


|Up — Upsil +... + |vg—1 — Uql < |s| (P-7 +...+¢q°77") : 


As the series }>1/n°t* converges, Cauchy’s criterion for it gives the expected uni- 
form upper bound. 
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11 — Liouville’s Theorems 


Lett wi and w2 be two complex numbers whose ratio is not real and L the lattice 
of points w = niwi + new2 with ni,n2 € Z. An elliptic function attached to the 
lattice L is a meromorphic function on C satisfying 


f(u+w) = flu) 
for all w € L, for example the series 57> 1/(u —w)* with k > 2 and 


gr (u) =1/u? +> [1/(u—w)? - 1/w?] 
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of Chap. II, n° 23, which we will encounter again later. The general properties 
of these functions are set out in theorems due to Liouville (1844), Hermite and 
Weierstrass, now easy to prove. If f is elliptic, then obviously so is f(u + a) for 
all a, as well as the derivatives of f. Any function P(fi,...,fn)/Q(fi,---, fn), 
where P and Q are polynomials and the f; are elliptic functions is again an elliptic 
function. In fact, it will be shown further down that any elliptic functions is a 
rational function of g@ and ¢’. 

This theory is the simplest non-trivial illustration of the expansions on Riemann 
surfaces given in Chap. X. First of all, the quotient S(L) = C/L of the additive 
group C by the discrete subgroup L can be considered as a compact group [Chap. XI, 
n° 15, (vi)]. It is compact since there are compact sets K C C having non-trivial 
intersections with all classes mod L, for example the parallelogram generated by wi 
and wz. To define a Riemann surface structure on S(L) , it must be equipped with 
an atlas (U;,;) whose chart changing maps are holomorphic (Chap. X, n° 1). To 
this end, we consider open sets W in C whose images W + w under translations 
by elements of DL are pairwise disjoint. For all z € C, any sufficiently small disc 
centered at z clearly answers the question. Since the canonical map 7 from C onto 
C/L transforms every open set into an open set [Chap. XI, n° 15, (iii)], it maps 
each W homeomorphically onto an open subset of C/L. Hence there is an inverse 
homeomorphism y from 7(W) onto W, and thus a chart 7(W). If W and W’ are 
two such open subsets and y, y’ the corresponding maps, for all ¢ € 7(W)Nz(W’), 
there exists w € L such that y’(¢) = y(¢) +, and as there clearly exists only one 
w mapping a sufficiently small disc centered at y(¢) onto a disc centered at y'(C), 
the change of chart map reduces to a translation z > z+w on every sufficiently 
small open subset of (W) M 7(W’), qed. 

With this definition, holomorphic functions on an open subset U of C/E are 
the same as holomorphic functions defined on the L-invariant subset 1~'(U). The 
notion of poles, of the order of a pole or of a zero, etc. is the same as in C and in 
C/L. In particular, elliptic functions are just the meromorphic functions on C/L. 

Henceforth, when used without further explanations, the letter w will denote 
an arbitrary element of LD. 


Theorem 7 (Liouville, Hermite). All entire elliptic functions are constants. 
For any non-constant elliptic function f, 


Se) SO, S = Res(f,a) = 0. 


Needless to say that summations are extended to all classes mod L. The former 
result is obvious even without knowing Cauchy theory: it suffices to expand the 
given function into a Fourier series with respect to one of the periods (Chap. VII, 
n° 17) and to write the relations that coefficients have to satisfy so that it admits 
the other period. The latter, as indeed the former, is just a restatement of theorem 
1 of Chap. X, n° 1. 


Corollary 1. Let f and g be two elliptic functions such that va(f) = va(g) for all 
a€C.Then f and g are proportional. 


For f/g is an elliptic function without any zeros or poles. 
Corollary 2. There are no elliptic functions of order 1. 
For such a function would have a unique simple pole with non-trivial residue. 


Corollary 3. An elliptic function has as many zeros as poles. 
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The zeros and poles are obviously counted mod L and their multiplicities taken 
account of. The total number of zeros or poles, or more generally of the equation 
f(u) =c, is called the order of f. 

Applying this corollary to f — c, where c is a constant, shows that f is of order 
p, and the equation f(u) = c has p roots mod L, as usual taking their multiplicities 
into account. For example, an elliptic function of minimal possible order, namely 
2, has only the following possibilities: 


(i) two simple poles and two simple zeros, 
(ii) a double pole and two simple zeros, 
(iii) two simple poles and a double zero, 
(iv) a double pole and a double zero. 


On a Riemann surface, it is necessary to consider meromorphic differential 
forms as well as functions. If w is such a form on S(L), its inverse image wo 7 is 
clearly (Chap. IX, n° 16) a meromorphic differential form f(z)dz invariant under 
zt— z+w, which is equivalent to saying that f is an elliptic function. As, in the 
neighbourhood of every point of C, the form f(z)dz expresses @ in a local chart 
of C/L at 2(z) =a, the order va(w) at a defined in Chap. X, n° 1, is just va(f). 
Hence v(w) = v(f) = 0, and as the genus g of a compact Riemann surface is defined 
by the relation v(m) = 2g — 2, we conclude that the Riemann surface C/L is of 
genus 1. One can show that all compact Riemann surfaces of genus 1 are obtained 
in this way. 

Finally note that C can be considered to be the universal covering space of the 
Riemann surface C/L (Chap. X, n° 3). 


12 — Elliptic Functions and Theta Series 


There has always been two possible points of view concerning the construction of 
elliptic functions: that of Abel and Jacobi, historically the first one, on the use 
of theta series and infinite products representing them and that of Eisenstein and 
Weierstrass, based on partial fractions expansions. Most textbook authors follow 
the second one, which is easier as it spares readers Abel and Jacobi’s calculations. 
This advantage is in fact an inconvenience, for as was seen in the preceding §§ 
these calculations have other uses. When they have been understood, they lead 
to existence theorems more quickly than Weierstrass’ methods. We will therefore 
present both points of view starting with that of Abel and Jacobi in this n°. 


(i) Abel’s theorem. Abel and Jacobi infinite products studied in § 2, n° 6 enable 
us to find all elliptic functions through an extraordinarily simple procedure. 


Theorem 8. Letai,...,Qn be the zeros and bi,..., bn the poles mod L of an elliptic 
function f, counted with their multiplicities. Then 


(12.1) So ak = Joby modL. 


Conversely, relation (1) ensures the existence of an elliptic function f whose only 
zeros and poles mod L are the points ax andby. This function is unique up to a 
constant. 


Necessity of the condition. Let us set g = f’/f and h(u) = ug(u). The function g 
being elliptic, h(w+w) = h(u) +wg(u). Consider the period “parallelogram” given 
above. It is somewhat deformed so that f does not have any poles or zeros on its 
boundary. 
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Cc 


Fig. 2. 


[rendu [ h(ujdu =f udu f h(u bon) du= 


= —we - g(u)du 


and as there is a similar relation between integrals along BC and DA, 


; h(u)du = —wi d —w d : 
(12.2) Joop, hodua men fh altaon fais 
Since f(A) = f(B) = f(C) = f(D), the next lemma applies: 


Lemma. Let : [0,1] —> C be a C' path and f a holomorphic function on an 
open set containing (1) and without zeros in (I). Suppose that the values of f at 
the endpoints are equal. Then 


[alterna 


By definition, the integral to be calculated is obtained by integrating along the 
inverse image I = [0, 1] under yp of the differential form df/f. Setting y(t) = f[u(t)], 
it is, therefore, a question of integrating dp/y = y’(t)dt/y(t) from 0 to 1. However, 
the map t +> g(t) is a path y in C, and the integral to be calculated is, by 
definition, just that of dz/z along 4, i.e. the variation of a uniform branch of Log z 
along y [Chap. VIII, § 2, n° 4, (i)]. If (0) = f[H(0)] = f[u(1)] = ¢(1), the path + 
is closed. So this variation is an integer multiple of 277, qed. 

This being settled, (2) shows that the integral of h along ABC'DA is of the form 
niu +n2we, up to a factor 277i, i.e. is in L. But the poles of h in the interior of this 
contour are the zeros and poles of f. At a point a where the series expansion of f 
starts with a term in (u—a)*, that of f’/f starts with a term k/(u— a) and that 
of uf’(u)/f(u) with the term ka/(u — a), whence Res(h,a) = ka. So the sum of 
the residues of h in P(a) equals the sum of the zeros ax minus that of the poles bx, 
repeated as many times as their multiplicities since each point a where the order 
of f is k, which can be positive of negative, contributes an amount equal to ka to 
the result, proving the necessity of (1). 

Sufficiency of the condition. The solution is clearly unique by corollary a of 
theorem 7. 
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To simplify calculations a bit, the proof can be reduced to the case where the 
lattice L is generated by 1 an z with Im(z) > 0. To see this, it suffices to observe 
that the imaginary parts of w1/we and w2/wi are of opposite signs, which enables 
us to assume that z = w2/w is in the upper half-plane. Then the elliptic functions 
of the given lattice L are clearly the functions f(wiu), where 


fut+tl) = f(u+z) = flu). 


On the other hand, the given a, and by are clearly only involved through their class 
mod L. To construct the expected function f, one can, therefore, suppose that 


(12.3) So an = So dk. 
This being so, let us set 
(12.4) q=exp(miz), w=e(u) = exp(27iu) 


and start from the Jacobi identity (n° 6, theorem 4) 


ae) . 2 
0(u; z) = , er z+2rinu = ) ge w” = 


(12.5) =][@-@") G47" uw) (14+ ¢"'w"). 


A trivial calculation using the series shows that 
(12.6) O(uti1,z)=O(u;z), O(ut+z)=q 'e(—u)O(u; z). 


Since 6(u, z) is, moreover, an entire function of u (Chap. VI, § 4, n° 20, theorem 18), 
clearly, for all c € C, the function 


(12.7) f(u) = T]@ (w= an +52) / J] 8 (w= be +52) 


is meromorphic and 


fut) =u), fez) =e(Yoae— dod) Flu) = fw) 


by (3). The points a, and by remain to be shown to be the only zeros and poles of 
f, counting multiplicities. 

However, the zeros of the infinite product @(u,z) are the values of wu where 
w=q°""! for some n € Z, i.e. such that 


u=womodL where wo = ae +2). 
For two different factors of the infinite product to be simultaneously zero would 
require g?”~!w = 1 for two (not necessarily positive) values of n. But then w would 
be a root of unity, and so |g| = 1, which is impossible. As moreover the roots of 
the equation e(w) = c are simple, the zeros of 0(u, z) are u=wo mod L and are all 
simple. 

Choosing c = wo shows that the zeros of the numerator (resp. denominator) 
of the right hand side of (7) are, modulo L, the points a, (resp. bg) with the 
correct multiplicities. The zeros of the numerator cannot cancel out those of the 
denominator since all az are distinct from all b, mod L. So the function 


(12.8) f(u) = [] 0 (= ax +4052) / JT] 8 (w= bx + 0; 2) 
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answers the question upon the choice of (3), which is always possible, for a, and 
bp, qed. 

Using the function @1(u; z) defined in (iii) further down, the result can be put 
in the much simpler form 


(12.9) [1 (w= aes 2) /T] A (w= bas 2) 5 


which Hermite availed of in the 1860s. He was one of the first to systematically 
apply Cauchy’s methods. 

Without more ado, some authors attribute the previous theorem to Abel, who 
used an infinite product very similar to that of Jacobi. Abel and Jacobi, and Gauss 
before them without publishing, could obviously show that formula (8) provides 
some elliptic functions and, through explicit calculations, those that were already 
known. But how could they have proved that it gives all of them in the 1820s, 
when Cauchy’s theory, barely invented, were practically unknown, starting with 
the precise notion of an analytic or a holomorphic or a meromorphic function? 
For example, Christian Houzel says”® that « without in any way using Cauchy’s 
methods to study one variable complex functions », in 1844 Liouville proved that a 
doubly periodic functions has as many zeros as poles, that the sum of its zeros is 
equal to the sum of its poles and that if f is an elliptic function of order 2, any 
« meromorphic» function admitting the same periods is of the form P(f)+Q(f)f’, 
where p and Q are rational fractions. He also says that Hermite was the first who 
introduced in 1848 Cauchy’s methods by integrating along the boundary of a period 
parallelogram, Cauchy showing in 1851 that all this follows easily from his residue 
theory. In fact, referring to Cauchy was not necessary to implicitly ensure that, 
in the neighbourhood of each point a € C, elliptic functions are power series in 
z—a possibly divided by a power of z — a: this is how a rational fraction behaves, 
a fact then long well known, and this is what Liouville implicitly implied when 
talking of «functions of x + ./—ly». For example to show that (8) represents all 
elliptic functions, it then suffices to show that an elliptic function without poles 
is a constant. As I mentioned in Chap. VII, n° 18 this Liouville knew how to do 
using Fourier series. As for Abel and Jacobi, they seemingly found it sufficient to 
have shown that their miraculous infinite products made it possible to compute all 
elliptic integrals and functions that they give rise to. 

Finally, note that the Jacobi series has the advantage over infinite products of 
converging far more rapidly because the exponents of q are the squares 1, 4, 9, 
16, etc.. This makes numerical computations easier for practical purposes. Jacobi 
was the first to have observed this in the case of the “ Poinsot type” movement of 
a heavy solid body with a fixed point, a problem of mechanics leading directly to 
elliptic integrals. 


(ii) General theta functions. The previous proof shows the importance of having 
available entire functions, the “general theta functions”, which, instead of being 
doubly periodic, satisfy relations of the form 


(12.10) f(u+1) =e(au+b)f(u), fut 2) =e(cu+a)f(u) 
with given constants a, b,c,d. There is then obviously a more general relation 


f(ut+w) =e (aru + be) flu) 


6 in Jean Dieudonné, Abrégé d’histoire des mathématiques 1700-1900 (Hermann, 
1978), tome II, pp. 21-22. 
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for all w € L, so that the zeros of such a function can be divided into finitely many 
classes mod L. The same is also true for all products and quotients of meromorphic 
functions satisfying identities such as (10). 

If P(u) = au? + bu +c is a second degree polynomial, then 


P(u+1) = P(u)+2auta+b, P(u+z) = P(u) + 2auz + az? + bz, 


so that f(w) = exp[P(u)] is a theta function, so-called trivial for obvious reasons. It 
has no zeros, and this property characterizes the trivial theta functions since (10) 
shows that (f’/f)’ is an elliptic function without any poles and so is constant if f 
has no zeros. It is then immediate that multiplying a solution of (10) by a trivial 
theta function, one may reduce to the case where 


(12.10’) fut+lh=f(u), f(ut+z)=e(cu+a)f(u) 


with coefficients c and d different from above, which makes calculations somewhat 
easier and as will be seen, enables us to determine all the solutions of the problem. 
Replacing wu with u + 1 in the second relation (10), we first observe that the 
function e(cu+d) must be of period 1. Hence it is necessary to assume c = —p € Z. 
On the other hand, the first relation (10’) means (Chap. VII, §4, n° 17) that f 
is a Fourier series 


(12.11) f(u) = S > ane(nu) 


converging normally in every compact set. The second relation (10’) is equivalent 


to 
x ang’ e(nu) = e( d)S~ ane [(n — p)u] = e(d) ye An+pe(nu) . 
So 


Ontp = e(—d)q?" an 


and more generally, after a short calculation,?” 


Ontkp = ane(—kd)q® *— Dp t2kn 


for all not positive and negative k € Z. Decomposing series (11) into partial sums 
(12.12) fn(u) = so are(ru) = e(nu) os An+kpe(pku) , 
r=n mod p keZ 
gives 
fn(u) = e(nu) ) > e(—kd)g**-Y?*?*" (pu) = e(nu) S~ q?* e(kv) , 
keZ 
up to a factor a,, and so 


(12.13) fr e(nu) rage (kv) = e(nu)O(v; pz) , 
kEZ 


27 There are many trivial but tedious calculations in this theory that I will not give 
details of, leaving it to the reader to do so. 
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where 
(12.14) v=pu-—uotnz, uo =d+pz/2. 
This calculation however assumes that 


2 
S- lal?* +(2n—p)k < +00. 
kez 


Since, for large |k|, the exponent |q| is equivalent to pk’, the series converges in the 
same conditions as 


oy exp (nipzk*) : 


i.e. for Im(pz) > 0. As Im(z) > 0, tt is, therefore, necessary to assume p > 1 to get 
non-trivial solutions. 

As the Jacobi series satisfies (9), it is conversely to see, after some short calcu- 
lations, that function (13) satisfies 


(12.15) fut =flu), — f(w+z) =e(-pu +a) f(u) 
for all n. The coefficients an (0 << p—1) can be chosen arbitrarily, from which 
one deduces that, for c = —p, the solutions of (10) form a p-dimensional complex 


vector space. In fact, this assumes that the p functions f,(u) are known to be 

linearly independent, but (12) shows that the same function e(ku) never occurs in 

two different series (12). Linear independence then follows from the fact that a (real 

or complex) Fourier series can be identically zero only if all its coefficients are zero. 
For p = 1, the previous calculations shows that all solutions of 


fut)=fu),  flu+z)=e(-u+d)f(z) 


are proportional to 0(u — uo; z), where uw) = d+ z/2. 


(iii) Metamorphoses of the Jacobi series. The Jacobi series has fascinated dozens 
of mathematicians in the 19th and also in the 20th century. Starting with Jacobi 
himself,?* they have proved numerous identities where it occurs. Jacobi, and much 
before him Gauss in his secret papers, singled out four particularly important func- 
tions (in all cases summation is over n € Z): 


28 The reader can refer to his memoir Ueber unendliche Reihen deren Exponenten 
zugleich in zwei verschiedenen quadratischen Formen enthalten sind (CEuvres 
completes, tome 2, pp. 217-288) . Dozens of identities between infinite products 
and series can be found in it. They are collected at the end of the article in three 
typographically very dense pages of results. 
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O1(u3z) = —iexp(miu + wiz/4)0 [wt (z + 1)/2; z] 


= —i)-(-1)" exp [ri (n+4)? 2+ (Qn+ L)riu| 


= 2q'/4 (sin zu — q'? sin 37u + q?* sin 5ru — ...) 


02(u;z) = exp(miu + miz/4)0(u 4 


= )oexp [ri (n4 


= Qqi/4 (cos Tu 4 


A3(u;z) = O(u; z) 


= Yoexp (min*z + 2rinu) 


rq 


+ 1)? z+(2n+ Liu] 


2-3 cog5rut.. .) 


— 1+ 2(qcos2ru + q* cos4ru + q? cos6ru +...) 


Oa(u;z) = 0(ut 332) 


= Y\(-1)” exp (win?z + 2rinu) 


— 1 — 2 (qcos2ru — q* cos4ru + q° cos6ru — ...) : 


In my preface to vol. I I wrote that I refused to economize paper. There are nev- 
ertheless limits to be respected, and as a simple but efficient way of limiting the 
length of text is to replace detailed proofs by exercises, here are some. They will be 


referred to later. 


Exercise 1. Prove the following relations, where w = e(w): 


x(a; 2) = g/* J] (1—a") (1- gw) (1-7) 5 


n>1 


bo(u; z) re qgl4 II (1 = ¢") (1 +7"w) (1 +¢" wt) 


n>1 


(12.16) 


q”‘w) (1+q?""*w*) : 


O4(u; z) = Il (1—@") (1-q?" tw) (1-¢?"tw) . 


n>1 


Exercise 2. The table below gives the zeros of functions modulo periods, the 
factors by which u +> u+1 or u+ > u+ z multiplies them by and what they 


become under z+—> —1/z. 


Z6LOS ur >util ze >—l/z 
01 (u; z) 0 -1 ~i(z/i)? exp (miu?z) 0; (uz; z) 
02(u; z) 1/2 -1 (z/i)2 exp (miu?z) 4(uz; z) 
03(u;z) (14+2)/2 1 (z/i)2 exp (miu? z) 03 (uz; 2) 
04(u; 2) 2/2 +1 (z/i)2 exp (wiuz) 02(uz; 2) 
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Concerning the effect of z ++ —1/z, use the Poisson formula 


(12.17) So f(ttn) = >> f(nje(nt 


and the fact that the Fourier transform of exp(it?z) is (z/i)~2 exp(—mit?/z) for 
Im(z) > 0 (Chap. VIL, §6, n° 28). 

Show that the functions u ++ 6%(u;z) are even for k > 2 and that 61(u;z) is 
odd. 

The importance of these series is that their zeros are obvious and they all 
transform under w+ u+1 or ur > u-+ z in the same way, up to sign. 

Exercise 3. Define the Germans’ Thetanullwerte by 


6n(z) =6n(0;2) for 2<k<4. 
Show that 
62(2) = 2q'/*T] (1—a?") (144°)? = Dexp [mi (n +4) 2] , 
(12.18) 63(z) =T] (1—@?") (14 ¢2""1)? = exp (min?z) = 6(z), 
6a(z) = TE (1—9?") (1-91)? = Y(-1)” exp (win?z) 
and that 
(12.19) 94(z)82(z/2) = 2n(z)n(z/2), 


where 1(z) is the Dedekind function (n° 3). 


Exercise 4. Let g\") (z) denote the nth derivative of | u+—> 6%(u; z) for u = 0; 
an abusive but traditional notation. Show that 


(12.20) 6,(z) = 1qi/4 Pe an + 1)qgrt) 
and that 

(12.21’) 61(z) = 2nn(z)? = 2nq'/4 Il i= gs ; 
(12.21”) = 102(z)03(z)04(z) . 


Reduce (21’) to identity (6.10) of §2 and, for (21”), use the identity 


(12.22) [[@+@) G-¢"") =1, 


which is proved by multiplying the left hand side by [[(1 — q”). 
Exercise 5. Justify the table below, which gives the effect of z ++ z+ 1 and 
zt» —1/z on the functions concerned: 


A(z) 42O4(z) — (z/4)3/?04(z) 
A(z)  4202(z) — (2/4) 204(2z) 
Os(z) — a(z) —(z/t) 20 (z) 
Os(z) _Os(z)_——(z/4) 822) 
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where i2 = exp(mi/4). 

Exercise 6. Recover formulas of exercises 1 to 4 of §1, n° 3. 

Exercise 7. Prove (21’) and (21”) by investigating the behaviour of both sides 
under z +—> z +2 and z+ —1/z and by showing that the ratio of one side of 
(21’) or of (21”) to the other satisfies condition (5.5) of §2, n° 5. 

Exercise 8. Using the same method show that 


(12.23) 03(z)* = 02(z)* + O4(z)* 


or that 


(Up Ogh 4 09? a.) (Pag Oe = Og eS 
= (1+ 2q + 2qt +299 +...)* 


Check the plausibility of this relation by a direct calculation of the first few terms 
of both sides. 

Exercise 9. Using the fact that, for c = —p, the set of solutions of (10) has 
dimension p, show that the four functions 6 (u; z)° generate a 2-dimensioinal vector 
space. Using the fact that two general theta functions satisfying the same functional 
equations and having the same zeros are proportional, show that 


(12.24’) O4(z)*04(u; z)? = 03(z)?03(u; z)” — 02(z)?00(u; z)” , 
(12.24” ) O4(z)*01(u; z)? = 02(z)?03(u; z)? — 03(z)?0(u; z)” , 
by first showing that the right hand side of (24’) is of the form c(z)04(u; z)* and by 


calculating c(z) using (23). 
Exercise 10. Show that 


aa ae) (Stamest) =P" 


(12.25”) 03(z)* = 04 (z)/04(z) — 03 (z)/O2(z) 


First check that the left hand side of (25’) is independent of u. To calculate its 
value, put w= 4 and use (21”). To get (25”), use the Taylor series of the functions 
wr On(u; z) at u = 0. 


Exercise 11. Show that (notation of exercises 3 and 4) 
03, (2) = 4ni.dO;,(0; z)/dz. 
Using (25”), deduce that 


(x0"] = 148 > mg”/(1—4")—8 So Amg'm/ (1 - gm) = 


m>1 m>1 


=1+8S enq”, 


where cp is the sum of the positive divisors of n that are not multiples of 4. Show 
(Lagrange, and proved again by Jacobi) that every rational integer is the sum of 
four squares. 


29 There is a very different proof using Eisenstein series, as well as a result related 
to sums of 8 squares; see for example Freitag and Busam, Funktionentheorie, 
Chap. VII, §1. 
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Many other properties of theta series can be found in Rademacher, Topics 
in Analytic Number Theory (Springer, 1973), Chap. 10 and 11, in Whittaker & 
Watson, Modern Analysis, where I found these exercises, in David Mumford, Tata 
Lectures on Theta I (Birkhauser, 1983), in Henry McKean & Victor Moll, Elliptic 
Curves (Cambridge UP, 1997) and in a number of earlier authors. Nowadays, mostly 
analogous series in several variables are studied. They have been used since Riemann 
in algebraic function theory and in other questions, for example in analytic theory of 
quadratic forms with integer coefficients, not to mention the extraordinary identities 
given recently by Macdonald, some of which we now describe. 

They provide series expansions for all functions n(z)%, where n(z) is the 
Dedekind function (n° 3) and N the dimension of a complex semisimple Lie al- 
gebra or, equivalently, of a compact subgroup of some GL,,(R) with finite centre. 
How to classify them and study them using the theory of “root systems” has long 
been known. This can for example be learnt in N. Bourbaki’s books on Lie algebras, 
in Dieudonné, vol. 5 and many other books. The simple ones consist of four series 
of classical groups (unitary, orthogonal with even or odd numbers of variables, and 
symplectic groups) and five exceptional groups with a complicated description. In 
the next formulas, summation is over integer vectors v satisfying certain conditions 
and ||v|/? denotes the sum of the squares of the coordinates of v. A constant co 
appears in each formula; it could be explicitly given by taking the constant term 
into account. We set q = exp(iz). 

For the unitary group with n + 1 variables and n even, 


nl2)""*) = 60 So (vi qllel?/@n+2) 


vi i<j 


nN 


where summation is extended to all v = (vo,...,un) € Z"*! for which 37 u; = 0 
and v; =imodn-+l. 
The other classical groups give similar formulas with N = n(2n+1) or n(2n—1). 
Using the five exceptional groups, it is possible to expand n(z) to the power of 
14, 52, 78, 133 and 248. For example, 


2 2 
Ho =a DD (ww) YD (wtobey toy aliel i, 


U,v i<j i<j<k 
where summation is over u € Z and vectors v = (v1,...,v9) € Z® satisfying 
u=8mod30, vj =imod30(1<i<8), vw =0, 3ut >, =0. 


Macdonald gives other curious formulas, for example 


n(2)'n(3z) y= en > [eed (nose) ) gilli? /12 


t i<j 


where summation is over v € Z? such that v; = imod6 and Sou; = 0. 
Those who think that the age of beautiful formulas is over are mistaken. 
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13 — Eisenstein and Weierstrass’ Point of View 


(i) Convergence of Eisenstein series. Obtaining functions defined on a group G,, not 
necessarily commutative, and right invariant under a countable subgroup*” I of G, 
ie. satisfying 


f(zy)=f(x) forallxeG andallyeTl, 


consists in choosing a function y on G and in setting 
(13.1) f(z) => 5 o(2). 
Indeed 

(13.2) f (ey) = 35 (277) 


then, and since, by definition of a group, the map y +> y’y is a permutation 
of the elements of I’, relation (1) trivially follows. For example, this is what the 
transformation of a function f on R into a function >> f(a + n) in the Poisson 
summation formula is all about. 

A slightly more general situation consists in supposing that I acts on a set X 
and to then search for I’-invariant-functions on X, i.e. satisfying 


f(y) = f(z) 
for all y € I’ and « € X. (1) is then replaced by 


(13.1’) fe) = oe). 


This argument nevertheless supposes that series (1) or (1’) converges uncon- 
ditionally, as this is the only assumption under which it can undergo the same 
transformations as a finite sum. 

Hence, if G is chosen to be the additive group C and I the period lattice L, 
then this leads to the construction of elliptic functions of the form 


(13.3) S> f(u-w), 


where summation is extended over all periods w € L. The series first needs to 
be made unconditionally convergent, which supposes that f tends to 0 sufficiently 
rapidly at infinity. As the aim is to get holomorphic or meromorphic functions, f 
must satisfy the same assumption. By Liouville’s theorem f cannot be supposed 
to be entire. Hence f must be admitted to have at least one pole at a point a, in 
which case if series (3) converges in a reasonable way, it will have poles at all points 
atw. 

Functions f(u) = 1/u* where k is an integer > 0 are the simplest ones satisfying 
these conditions. They lead to the series 


(13.4) pr(u) = >> 1/(u-w)". 


These converge unconditionally for k > 3 as shown by the following argument 
(Chap. I, §3, n° 23). 


3° In practice, the group G is always equipped with a locally compact topology (or 
even with a differentiable structure) and I’ is always a discrete subgroup of G. 
See Chap. XI, n° 15, (vi). 
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Let us start by considering a compact set K C C and the general term of (4) 
when u remains in K. To analyze the behaviour of series (3), terms for which u—w 
is 0 in K must first be omitted, i.e. the points w € K. As there are finitely many of 
them, the sum of these exceptional terms is a rational function of u. For the other 
terms, the relation |w| — |u| < ju — w| < |u| + |w| shows that, for sufficiently large 
|w| S|w| < |u—w| < 2|w| for all uw € K. Normal convergence in all compact subsets 
of (4) is equivalent to the unconditional convergence of > 1/|w|”*. 

To determine the suitable values of k — here k need not be an integer — we use 
the partition of the period lattice L into sets Ly of 8n elements, namely containing 
the points w € L located on the contour of the parallelogram P,, image of the fun- 
damental parallelogram under the homothety with centre O and ratio n. For these 
w€EIn, rn < |w| < Rn with constants r, R > 0 independent of n. However, by the 
general theorems (Chap. II, § 2, n° 18, associativity theorem 13), the unconditional 
convergence of the series )~ 1/|w|* is equivalent to 


~ x 1/|w|* < +00. 


n>0wEeELn 


The partial sum extended to L, having the same order of magnitude as n/n* = 
1/ n*—! the question reduces to the convergence of the Riemann series, whence the 
condition k > 2. 

These arguments still apply if in (3), instead of 1/u*, one chooses a rational 
function f satisfying 


f(u) x 1/u* at infinity with ,k > 3 


hence of the form p(u)/q(u) with d°(q) > d°(p) + 3. The general term f(u —w) is 
again of the same order of magnitude as 1/|w|" and, for large |w|, there is again an 
upper bound |f(u—w)| < Mx/|u—w|* < Mi|w|* valid for all u in a given compact 
set kK. If K is chosen to be an arbitrary closed disc, apart from finitely many terms 
with a pole in K, we get a normally convergent holomorphic functions series in K 
and so a holomorphic sum in the interior of D. In conclusion: 


Theorem 9. Let f be a rational function such that f(u) x 1/u* at infinity, with 
k > 3. Then the series 


(13.5) f'(u) =o f(u-w) 


converges normally in every compact set and its sum is an elliptic function whose 
poles are, mod L, those*! of f, with the same polar parts. 


This is in particular the case of the functions gx defined by (4): they have a 
pole of order k at lattice points and are holomorphic elsewhere. But the theorem 
applies to several other rational functions other than 1 /u*, for example to 1/(u — 
a)(u — b)(u — c) where a, 6, c are distinct mod L. 


(ii) The Weierstrass o-function. The series > 1/(u—w)? diverges and so cannot 
be used to construct an elliptic function with double poles at lattice points. As was 
shown in Chap. II, n° 23, the modified series?” 


31 To get the polar part of f” at a point a, the polar parts of f at all points 
a+w must be added, so that f” may well not have a pole at a. Hence, strictly 
speaking, the poles of f would need to be assumed to be distinct mod L. 

2 In formula (6) and many similar ones, the series are obviously extended to non- 
trivial periods. Some authors use the symbol 5~’ to indicate this explicitly. I 
prefer to trust the reader’s common sense. 
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(13.6) (u) = 1/u? + S~ [1/(u—w)? — 1/u?] 
converges because, for large |w], 
1/(u—w)? =w? (1 + 2u/w + 3u?/w? +...) = 
= 1/w* +wFu(24 38u/wt...). 


If u remains in a compact set K C C, then |u/w| < 4 except for a finite and fixed 
set of periods. Hence, for all w not in K, there is an upper bound 


[1/(u = w)? = 1/w?| < M/lw 


with a constant M independent of u € K. Series (6) from which the finitely many 
terms for which w € kK have been removed is thus normally convergent in K’, which 
shows that its sum is meromorphic on C, with double poles at all lattice points. 

Though correct, the relation o(u + w) = g(u) is no longer obvious. A direct 
proof was given in Chap. II, but as a similar problem arises in other cases, the 
situation may as well be generalized: 


Lemma. Let I be a group acting on the set X, f a function defined on X andc a 
function defined on I’. Suppose the following conditions hold: 
(i) the series 


(13.7) g(a) = 2 [fy2) - ey] 


converges unconditionally for alla € X ; 
(ii) the series )“[c(yyo) — c(y)] converges unconditionally for all yo € I ; 
(iii) for all yo € I of finite order and all y € T’,°? 


(13.8) lim c(770) =0. 


|n|=+00 
Then g(yx) = g(x) for ally € IT andallxe X. 
As yo € I’ is given, first of all, 


9 (yor) = >> LF (y7102) — e(y)] = 
= SOI (102) - e(y70)] + S2 fe (v70) - e(y)] - 


This formal calculation is justified by assumptions (i ) and (ii). The map y+—> yo 
being a permutation of the elements of I’, the first series has sum g(x). So the proof 
reduces to showing that 


33 Tf y is an element of a group I’, the set of powers y” (n € Z) of 7 can, depending 
on the case, be finite or infinite. The order of y is by definition the cardinal 
of this set. Example: in the multiplicative group T of complex numbers with 
absolute value 1, the roots of unity have finite order and the other elements 
infinite order. In the modular group SL2(Z) of 2x 2 matrices with integer entries 
and determinant 1, which acts on the upper half-plane by z —> (az+b)/(cz+d), 
the matrix S = € =) has order 2 and the matrix T = € 1) infinite order. 
They define the transformations z —> —1/z and z —> z+1 that we have already 
extensively used. 
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(13.9) S~ [e (70) — e(7)] = 0 


for all yo. 
This result generalizes the “obvious” formula 
(13.10) So [f(n +1) — f(n)] =0 


neZ 


whose proof is immediate in the case that is going to be useful to obtain the Lemma. 
If the series converges reasonably, its sum is the limit of its partial symmetric sums 
f(n+1)— f(—n). This gives (10) if f(n) tends to 0 as |n| increases indefinitely.*4 
Another useful case is that of a periodic function f, i.e. satisfying f(n+p) = f(n) 
for a given integer p. As the general term remains unchanged under n +—> n+ p, 
formula (10) is not well-defined in this case and it needs to be replaced by 


(13.11) S> [f(m+1)- f(n)] =0. 


nmod p 


The proof reduces to checking that 


fC) — £0) + F2) — FO) +... + fp) — f(p- 1) = 90, 


which is obvious since f(0) = f(p). 

Coming back to (10), let [> denote the set of 99, n € Z. Writing yIo for the 
set of elements yy’ with y’ € Ip (coset modIo), we get a partition of . The 
associativity theorem for unconditional convergence then shows that series (10) 


becomes 
SS DS [elo’n0) -e(7)] , 


V0 y'erTo 


where summation is first over elements yy’ of some coset yIp, then over the set of 
these cosets. As 7’ +> v7" is a bijection from I onto yI, the previous series can 


also be written 
S> YS lela’) - e(r7’)] - 


yo y"’Elo 


Hence it suffices to show that 
> Le (770) — (17"")] = 9, 


where summation is over 7" € I, i.e. over the set of powers of yo. Setting y"” = yf 
and f(n) = c(y7), the general term of the series becomes f(n + 1) — f(n). This 
reduces the proof to (10) if yo has infinite order since the elements yy of I are 
then pairwise distinct, and assumption (iii) of the lemma ensures the conclusion. If 
yo has finite order p, then®® 7% = e, the identity element. Hence f(n + p) = f(n), 
which reduces the proof to (11), proving the lemma. 


34 In Bernoulli’s days, it would probably have been noticed that each term f(n) 
occurs twice with opposite signs (for example if f(n) = n...). The argument 
of the text shows that sum (10) equals f(++-oo) — f(—oo) if f(n) tends to finite 
limits as n tends to +00 or —oo. 

> If an element g of a group G has finite order, its powers are pairwise distinct, 
and so g* = g° with distinct integers a and b. Hence there are integers p > 0 such 
that g? = e. If p is the smallest one among them, the elements g° = e,g,...,g?~1 
are pairwise distinct and the other powers g” reduce to them by replacing n with 
the remainder of its division by p. The number p is thus the order of g. 
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Example: The function 


f(z) =VWz+> 0 [/(@ +n) - 1/n] 


on C —Z has period 1: make the additive group Z act on C by translations and set 
c(n) = 1/n if n £ 0, c(0) = 0. The function f is meromorphic on C, its poles are 
obvious and f(z) — 7 cotg mz is entire with period 1. To show that f(z) = 7 cotg xz, 
the difference would need to be shown to tend to 0 at infinity, a non-obvious exercise. 


Coming back to the Weierstrass function (6), the lemma can be applied by 
choosing X = C with its periods removed since the series is not strictly speaking 
well-defined if w € L, and I to be the additive group L acting on X by translations 
u-> utw. The simplest function c®* is then given by c(w) = 1/w? if w 4 0, 
c(0) = 0. As was seen earlier, assumption (i) holds. Assumption (ii) means that 


$5 |1/w +0)? — 1/u?| < +00, 


where the two terms that are not well-defined are omitted. This is already known 
since the general term is O(1/w*). Finally, assumption (iii) means that lim 1/(w + 
nwo)? = 0 for wo # 0, which is clear. Formula (6) thus defines an elliptic function. 
We already know that its only singularities are double poles at lattice points. 

This property more or less characterizes the function go. Indeed if y(u) is an 
elliptic function whose only singularities are double poles at points of L, there is 
an expansion y(u) = a/u” +6+... in the neighbourhood of the origin because 
the residues of y are zero by theorem 7. The elliptic function y(u) — ag(w) is then 
entire, hence constant. In conclusion: 


Theorem 10. The series?” 


(13.12) gr (u) =1/u? + S— [1/(u = w)? — 1/w] 


converges normally in every compact set and its sum is an elliptic function whose 
only singularities are double poles at lattice points. Any other elliptic function with 
this property is of the form agr(u) + b where a and b are constants. 


As was seen in Chap. II, n° 23, and as is now obvious because of Weierstrass’ 
theorem on holomorphic function series (term by term differentiation), for k > 3, 
the functions 


pr(u) = 5 1/(u—w)* 
are proportional to the derivatives of ¢: 


(13.13) g'(u) =—293(u), " (u) = 2.3ea(u),.... 


Note that as u tends to 0, the terms of series (12) with the term 1/u? removed tend 
to 0. As this series converges normally in the neighbourhood of the origin, its sum 
also tends to 0. Therefore, the function 9 being even, there is a series expansion of 
the form 


36 The choice c(0) = 0 as well as that of c(e) in the general lemma are is irrelevant. 
Modifying c(y) for a finite set of values of 7 adds a constant to function (7), so 
does replacing c(y) by c(y) + c’(y) where )> |c'(7)| < +00. 

37 The notation oz is used to point out the dependence with respect to the lattice 
L. It will always be omitted when there is no ambiguity. 
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p(u) =1/u? +eur+.... 
Calculating its coefficients is easy: the series appearing on the right had side of 


(12) being normally convergent in the neighbourhood es £: the power series of the 
functions it is composed of can be added term by term.** However, for w 4 0, 


1/(u—w)? =w?(1—u/w)* =w? N° (n+ Iu 
= s Co Dar fat. 


Thus 

(13.14’) ox(u) = 1/u? + oe Aon” 

with 

(13.14”) don = (2n+1) S01 /w°"*? = (2n + 1)Gon42(L), 


where summation is over non-trivial periods. As shown much earlier (Chap. II, 
n° 23, Chap. VII, n° 18), as a result, the function ¢ satisfies the differential equation 


g’(u)” = 49(u)® — 20a2e(u) — 28a4 = 49(u)* — g2e(u) — 93 


in conventional notation. The proof consists in showing that the polar parts of both 
sides coincide at the origin, and that, as a consequence, the difference between the 
two sides is an elliptic function without any poles, hence constant, and in checking 
that it is trivial by calculating the terms independent of u in the power series of 
both sides of (14”). 

In the case where the lattice L is generated by 1 and z, with Im(z) > 0, let 
y(u; z) denote the corresponding function gz. Then 


dan = (2n + 1) >> 1 /(ez+ d)?"*? — (Qn + 1)Gan4a(z) , 


where summation is over all ordered pairs of integers (c, d) # (0,0). As will be seen, 
the Eisentein series Gan(z) play a basic role in the theory of modular functions 
because of the functional equation 


G,, [(az + b)/ (cz + d)] = (cz + d)"Gn(z) 


satisfied by all transformations of SL2(Z). The function (u; z) remains unchanged 
if z is replaced by z+ 1, but an immediate calculation shows that 


(13.15) z *@(—u/z;—1/z) = p(u;z). 


(iii) The series S\ n?/sin? x(u + nz) and Go(z). For a less trivial example of 
the construction of elliptic functions that makes elliptic functions out of functions 


38 The coefficients of the power series of a holomorphic function in the neighbour- 
hood of 0 are its successive derivatives at the origin. However, one can pass to the 
limit in the derivatives of a uniformly convergent series of holomorphic functions 
(Weierstrass). The power series of a uniformly convergent limit (resp. series) is 
thus given by the formal calculation that would have seemed obvious to Euler... 
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that are not, consider the function 7? / sin? wu. It has period 1 and double poles at 
points n € Z. In a given compact set K, the terms of the series 


(13.16) f(u;z) = So 1? /sin? r(u + nz) 


are all holomorphic except the finitely many that have a pole in K. Hence, if (16) 
converges normally in all of kK C C — L, then the result will be of the form 


F(u; 2) = a(z) (uj 2) + e(z) 
with the constants appearing in the Laurent series 
f(u;z) = a(z)/w? +ec(z) +... 


of f at the origin. 
To analyze convergence, observe that setting as usual g = exp(miz) and w = 
exp(277iu), 


f(u;z) = an? 1 [ (gro! gw") . 
Z 


If u stays in a compact subset of C — L, then r < |w| < R withO <r < R< +00, 
and so |qrw? — gw | > |r? |q|” — R2\q\~”| # 0 since |q| < 1. As n tends to 
+oo, q” tends to 0 and |q|~” to +oo. Thus 


er ih Dia tad 
qrw? —q Haga > 5 FR? lal for large n, 


and so |1/(q"w2 - q"w-?)| < 2R-2\q\", which shows that the series with terms 
n > 0 converges normally. The proof remains the same for the series extended to 
n <0. 

To evaluate the constants a(z) and c(z), observe that, for any normally con- 
vergent series of meromorphic functions, the Laurent series of the sum can be 
formally calculated from those of the terms of the series. For n # 0, the term 
a /sin* r(u+ nz) contributes its value 1/sin? mnz at u = 0 to the calculation of 
c(z) and nothing to the calculation of a(z). For n = 0, we use the formula 


—2 


nr? / sin? TU = 7 (xu — ru? /6 ane) =u + ru?/3+... 3 


which gives a(z) = 1 and a contribution 17/3 to the calculation of c(z), so that 
e(z) = 47/34 > n / sin? ANZ. 
nA~0 
We thus get the formula 
(13.17) p(u;z) = s nm / sin? m(u+nz) — 17/3 — ye nm / sin? NZ. 
Z nZ0 
It is not unrelated to that of Weierstrass. Indeed®? 
39 Differentiate the relation 


1 1 1 
t =— y =] 3 
7 cotg Tu ae (+2) 


nA~0 
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nm /sin? ru = S01 /(u—m)? 


Z 
for non-integer u € C, and so 


ae = Ss So 1/(u= m= nz)? : 


neZ meZ 


We recover the series )>1/(u—w)? except for a “detail”: its unconditional sum is 
not well-defined, whereas a convergent result is obtained by first summing over m 
then over n. We can then write 


p(u;z) = SoS 1/(u m—nz)? — 17/3 


(13.18) -S> So1/(mt nz)’ . 


n40 mm 


Grouping together similar terms of both series, we find 


p(u;2) =1/u +> > [1 /(u oye 1 /w?| x? /34 Sy : 


wH0 m0 


This is Weierstrass’ formula since )> 1/m? = 17/3. (18) is generally written as 
(13.19) 9(u; z) =S>S01/(u- m= nz)? — Go(z) 


with a function 


(13.20) Go(z) = aes 1/(m+ nz)? = 77/34 yo /sin? TNZ 


nm nZ~0 
(m,n) F (0,0) 


similar to the series G2,(z) (n > 2), but which should be used cautiously since the 
double series does not converge unconditionally. The next section will show that 
permuting the summation order actually changes the result. 


(iv) Relation between ~ and 6, functions . Suppose the lattice L is generated 
by 1 and z with Im(z) > 0. To construct the Weierstrass function let us return to 
the power series 0;,(u; z) introduced in n° 2, (iii). We know their zeros (all simple 
because of infinite product expansions) and their transformation formulas under 
ur >utlandur >u+z (n° 12, Exercise 2). These immediately show that the 
functions 


fr(u) = 0; (u; z)?/ 01 (u; zy (k = 2, 3,4) 


are elliptic and their only singularities are double poles at the period lattice points. 
Hence 


fir (u) = an(z)@(u; 2) + be (z) = an(z)/u? + de(z) +... , 


with coefficients depending only on z. These are calculated by considering the Lau- 
rent series of the functions f, at the origin. Setting 


On (u;z) = On(usz)/On(z) for 2<k<4, 
Oi (us z) = O1(u; z)/01(z) 
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in order to get functions in the neighbourhood of 0 equivalent to 1 for k > 2 and 
to 1/u for k = 1, we find 


(uj z) = O(u; z)?/ OF (u; 2)” — 0 (z)/ Ox(z) 


where recall that 6/(z) denotes the second derivative at u = 0 of the function 
ut— O%(u;z). This result is not unrelated to Exercise 9 of n° 12. 

Another method amounts to observing that, given a general theta function f (wu) 
satisfying relations of the form 


f(ut+1) =exp(az+b)f(u), f(u+z) =exp(cz + d)f(u), 
its logarithmic derivative g = f’/f satisfies 
gu+1)=glu)+a, glu+z)= glu) +e, 


so that h = (f’/f)’ is an elliptic function. 
For example let us choose the odd function 6; (u; z) which is non-trivial at lattice 
points. Writing 


01(u;z) = a(z)u + B(z)u® + 


in the neighbourhood of 0 gives g(u) = 1/u+... and hence h(u) = —1/u2+... So 
there is a relation of the form 


[81 (us z)/ O1(u; z)]" = —@(u; z) + (2) 


with a function c(z) which, as will be seen, is the same as the one used to obtain 
(17). Since 


f(u) = O(u;z) =] -@") Q-@"w) (1-9? Pw) 


with w = exp(2miu) and differentiating with respect to u,w’ = 2niw, (w7')! = 
—2riw™t, 


f'(u)/f(u )=-2ni 57? "w/ ( 1-¢ "w) + 


n>1 


(13.21) +2ni 5° wt / (1- gw") . 


n>0 


Differentiating again with respect to u, it follows that 
(f “(u) /f(u )) =40P7 S30? "ew / ( 1-¢ Mw)? + 
(13.21’) +40? S> twit / (1g? w"1)” 


and 


n>1 
2 need =n i 7 
+4r Sol (a'w 2—q w?) a 
n>0 
= 4n? ss 1 /[2isina(u + nz)? + 
n>1 


+4? S~ 1 /[2isin m(—u + nz)? 


n>0 
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So finally, 
[04 (us z)/ 01 (us z)] =— Sox? /sin? m(u+nz). 
Zz 


This is function (16). Using (17), c(z) = —G2(z) follows and so 


(13.22) [04 (us z)/ O1(u; z)]’ = —(u; z) — Go(z). 
This result enables us to calculate the effect of z ++ —1/z on the function 


Ge(z). Let us denote by h(u; z) the left hand side and, in (22), replace z by —1/z. 
Taking into account the relation 


61(u; —1/z) = —i(z/i)? exp(miu?z)64 (uz; z) 


and the factor z? which occurs when the last term of the right hand side is differ- 
entiated twice with respect to u, we get 


h(u; —1/z) = 27h(uz; z) + Qmiz = 2? [—(uz; z) + Go(z)] + 2niz. 
However, by (15), 9(u; —1/z) = 2*@(uz; z). So relation (22) for —1/z becomes 


2? [o(uz; z) + Ga(z)] — Qmiz = 27 (uz; z) + Ge (-1/z) , 


given the expected formula 
(13.23) Go(—1/z) = z?Go(z) — Qmiz. 


It is due to Eisenstein who gave a direct elementary proof,*® but full of traps. 
Definition (20) shows that 


Go(-1/z)=2 S> S01 /(n— mz) =2 5°30 1/(mt nz)’ , 


(the term m = n = 0 is obviously omitted), so that we again get the series G2(z), 
up to the factor z”, but the order of summation has be changed. This operation, 
which would not change anything if the double series converged unconditionally, is 
not innocuous: indeed (23) can be put in the form 


(13.23’) So So1/(m+nz)? =So S01 /(mt nz)? = 2ni/z. 


Exercise 1. Show that, applying the relation 1/(1 — z) = >> z™, which supposes 
|z| < 1, to the terms of the two series appearing in (21) gives a double series 
converging unconditionally if and only if u is in the horizontal strip 

B:—Im(z) < Im(u) <0. 
Permuting the summation order, show that 
(13.24) go(u; 2) = —Go(z) + 4? > qe gam e(rne). 
m0 


4° See for example Freitag and Busam, pp. 390-393. 
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This is the Fourier series expansion of the function u ++ g(u;z).*1 What is the 
Fourier series of g(u) in the strip pIm(z) < Im(u) < (p+ 1) Im(z), where p € Z? 
Exercise 2. Using the identity 


Ss na” /(1— 2”) = ss ne = ye a™ (1-2)? 


n>1 m,n>1 m>1 
valid for |a| < 1, as well as the infinite product of the function n(z), show that 
(13.25) Go(z) = —4rin'(z)/ n(z), 


n being the Dedekind function. Deduce* that 


Ge (—1/z) = 2°Go(z) — 2niz n ( 1/z) = (z/i)? n(z). 


(v) Elliptic functions with given simple poles. They can be obtained either using 
Abel and Jacobi’s infinite products, or the Eisenstein- Weierstrass method. We still 
assume that L is generated by 1 and z. 

The first one, presented above, is the simplest: we fix the zeros ax, and the poles 
be (1 < k < n) mod L beforehand and consider the function 


(13.26) o(u) =] ] % (wax) / 01 (u— dx) « 


It clearly answers the question if it is elliptic. Invariance under u +> u+1 is 
obvious. Due to formulas of n° 12, exercise 2, u+> u+ z multiplies the function 
by a factor 


[let =x) - 2/21/T]el (u — by) 2/]=e(Soax— Sob) - 


Therefore, supposing }> a, = >> by, mod Z is sufficient to obtain the result. As we 
anyhow need to suppose > ax = >> bk mod L, the condition can always be assumed 
to hold. 

This method allows us to impose poles and zeros, but not residues. It is possible 
to construct an elliptic function with simple poles mod L at given pairwise distinct 
points a1,...,@n, with given residues ox at these points, but we then have no infor- 
mation about its zeros: everything cannot be imposed at the same time. Hermite 
only remarks that, if 


dee =9, 
a compulsory condition by theorem 8, then the function 
C+ > on64 (u — ar; z)/ 01 (u— ak) , 


where C’ is a constant, clearly answers the question (i.e. is elliptic) thanks to the 
transformation formulas given in the exercises of n° 12. 


“1 Tt holds in strip (24) where it is holomorphic of period 1 and diverges elsewhere. 


#2 This is for instance what Neal Koblitz does in Introduction to Elliptic Curves 
and Modular Forms (Springer, 1993, 2d. ed.), p. 121 to get (23). 
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The lattice L now being arbitrary, Weierstrass starts from the rational function 


(13.27) f(u) = Si On /(u— ax) 


satisfying the imposed conditions but which is not elliptic and makes it elliptic 
using the series 


(13.28) fE(u) = So flu-w) = 2 ox (u-w— an) « 


For it to converge, one must make sure that f(u) decreases sufficiently rapidly at 
infinity. At a first glance, f(w) ~ c/w where c = )> ox. Hence one must assume that 
> ox = 0, a not very surprising condition. Then for large w, 


ok (u—w — ax)? = —onw* [1+ (u—ak)/w]* = 
= 0x [1/w—- (u — an)/w? + (u — ax)? / w? — aed at 


Adding the results for 1 < k < n, the terms in 1/w disappear since 5+ 0, = 0 and 
the coefficient of 1/w? reduces to > o,a%. To make sure that (28) converges, it is 
also necessary to suppose 


(13.29) c= ye Onde =0; 


which does not augur well. Let us in particular suppose that n = 2. The two residues 
are opposites and (29) means that a1 = a2, in which case the function f, hence 
also f”, is identically zero... Nonetheless, there are elliptic functions with only two 
given simple poles a, and az as can be seen by using theta series. 

The general lemma used above to construct the Weierstrass function explains 
the mystery. For n = 2, f(u) = 1/(w— a1) — 1/(u — a2), up to a constant factor, 
and so for given u and large w, 


f(u-—w) =c/w? +O (1/w*) 


with c = a1 — a2. Thus the solution, for c 4 0, is to define f” not by (28), which 
diverges, but by 


(13.30) = ifu-w)— ew] , 
where c(w) = c/w? if w 4 0, c(0) = 0. Conditions (i), (ii) and (iii) of the general 


lemma can be readily checked and this gives the expected elliptic function. 
More generally, when starting from the function 


f(u) = S> ex /(u— ax) with cea 0h 


the necessity of condition (27) to ensure convergence of the series 5> f(u — w) is 
dealt with in a similar way. As we saw, for large w, 


f(u-—w) =c/w? +O (1/w*) where c= S > anor : 
Hence we apply the general lemma by choosing the function 


c(w) =c/w? ifw0, c(0)=0, 
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which leads to a function 
(13.31) f"(u) = flu) + 9° [f(u—w) - c/w] 
wA0 
with simple poles at ax +w, and given residues ox, and holomorphic elsewhere. It 


is unique, up to an additive constant. 


(vi) The functions Cr and oy. Let us continue our variations on the same the 
with Weierstrass and start from the function f(u) = 1/u. The series 1/(u—w) does 
not converge. For large w, 


1/(u—w) = —1/w(1 — u/w) = - (1/w4 u/w? + u? /w? 4 = 
= -1/w—u/w 4 O (1/w*) , 


so that the series 


¢L(u) = 1/ut+ >> [1/(u—w) +1/w4 u/w?| = 
— L/ut >> (w/w? +u?/w* +...) a 


(13.32) =1/ut > Gon(L)w" 


n>2 


converges and represents an odd meromorphic function whose singularities are obvi- 
ous. This why this type of formula is important. Because of the presence of the term 
u/w?, depending on u, the general lemma cannot be applied to prove invariance 
under u»—> u+w. Term by term differentiation gives 


13.33) —¢r(u) = pr(u). 
Thus, for any period, 

13.34) Cr(u+w) = Cr(u) + nr (w) 
with a constant 


13.34’) nu(w) = 2Cz (w/2) if w/2€¢ L 


since Cy is odd. Obviously, 
nL (w! +") = nL (w’) +nb (w”) 


for all periods w’ and w”. 
Exercise 3. If Im(w2/w1) > 0, then went (wi) — winn(w2) = 277. 
The function ¢, enables us to put formula (25) in the form 


(13.35) fr(u) = S— onCr (u— ax) 


because the sum of the terms g,/w which should appear in the last sum of (25) is 
0 since > gx = 0. By relation (34), the periodicity of the right hand side of (35) is 
obvious. 

Another function related to gr and ¢r is the infinite product 


(13.36) or(u) =u] ] (1 —u/w) exp (u/w + u?/2u”) = -o1(-u) 
wA0 
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invented by Weierstrass to obtain an entire function with simple zeros at all points 
of the lattice L and 4 0 elsewhere. Convergence is immediate since the general term 
is 

(1—u/w) (1+ u/wtu?/Qw? +...) (1+u7/2u? +...) = 

= (1—u/w) (l+u/wt+u?/w* +...) =14+0 (u?/w®) . 


Differentiating (3) logarithmically gives 
(13.37) o1(u)/or(u) = Cr (u) 


and as a result, 


(13.38) pr =—(or/or)’ . 


Comparison with (22) suggests a simple relation between functions o and @; and, 
at the very least, an infinite product expansion of o*? very different from (36)... 


14 — Elliptic Integrals 


(i) The field of elliptic functions. Weierstrass was apparently the first to obtain the 
next result: 


Theorem 11. Every elliptic function f is a rational function of @ and g’. 


This means that there is a function R(X,Y) = P(X, Y)/Q(X,Y) of two vari- 
ables, where P and Q are polynomial, such that 


(14.1) f(u) = R (@(u), @'(u)) 


for all u € C for which the relation is well-defined, i.e. except for a discrete set of 
values of wu. 

If the function f has poles not belonging to the period lattice L, then let az 
(1 < k < m) denote their representatives mod L and let r; be their multiplicities. 
As the function = gr is holomorphic at these points, the elliptic function 


9(u) = f(u) [J] lo) — @ (ax)]" 


is holomorphic outside L. If it is holomorphic at u = 0, hence on all of C, it is 
constant and the theorem is proved. 
So let us suppose this is not the case. There is a Laurent series 


g(u) = cr fu” + pa /ur* Polis 


in the neighbourhood of 0 with c, # 0 and r > 2 since an elliptic function cannot 
have a single simple pole mod L. If r = 2, the function g(w) — c2(u) has, at most, a 
simple pole, and so is again constant. If r > 3, there are two possibilities. If r = 2m, 
the order of the pole of g can be reduced by subtracting the function c,g(u)”. If r 
is odd, then r = 3m + 2n with n € {0,1}. As g’(u) = —2/u°+4+..., it is obviously 
possible to choose a constant c such that g(u) — cg’(u)™ @(u)” has a pole of order 
<r at the origin. Induction on r implies that a polynomial in g and g’ canceling 
out the pole of g can be subtracted from g, thus making g constant, qed. 


43 See for example Joseph H. Silverman, Advanced Topics in the Arithmetic of 
Elliptic Curves (Springer, 1994), p. 53. 
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Since g” is a polynomial in g, this argument shows that every elliptic function 
is actually of the form 


(14.2) f = Ril) + @'Ra(~) 


where the R; are rational functions of one variable. 

Even if theorem 11 does not hold for all elliptic functions, it is far from being 
restricted to the function 9. One could for example use a function with only two 
simple poles. This is form (2) of R(y, 9’) which is so because of its simplicity. 

This can be interpreted in the algebraic language of commutative field theory. 
For a given lattice L, let K denote the set of corresponding elliptic functions. 
They can be added, multiplied and divided, and these operations, applicable to 
all meromorphic functions on a given domain, obviously satisfy the axioms of a 
commutative field. Among the functions f € K there are rational functions of 9, 
which form a subfield C(g) of K containing C. The element ¢ of C() is obviously 
transcendent over C, i.e. does not satisfy any algebraic equation 4° ang” = 0 with 
coefficients in C: such a relation would force the Weierstrass function to take only 
finitely many values. 

The differential equation of the function g shows that, on the other hand, g’ is 
algebraic over C(p): o’? € C(g). In fact, every f € K is algebraic over C(g). To see 
this, it suffices to write f and f? as in (2) and to remove g’ from the two relations 
that arise. This clearly gives an equation af? + bf +c = 0 with a,b,c € C(@). a, 
b, c can be assumed to be polynomials in ¢ (get rid of denominators), which means 
that there is always a non-trivial algebraic relation P(f,@) = 0 between f and ¢. 
More generally: 


Corollary 1. Let f and g be two elliptic functions having the same periods. There 
is a non-trivial polynomial P € C[X,Y] such that P(f,g) = 0. 


It could be tempting to prove this by constructing a polynomial P for which 
the elliptic function P[f (x), g(u)] does not have poles. This would require intricate 
calculations since f and g cannot have an arbitrary number of poles randomly 
distributed in C , modulo the period lattice. The abc of commutative field theory 
gets totally rid of these difficulties and makes the corollary a very particular of a 
much more general and purely algebraic result.** 


Corollary 2. Every elliptic function satisfies a differential polynomial equation 


PF, f') =0. 


(ii) The Riemann surface of the field of elliptic functions. We saw in n° 11 that 
elliptic functions of the lattice LZ can be identified with meromorphic functions on 
the compact Riemann surface C/L. On the other hand, the functions g and g’ are 
connected by algebraic equation (13.14”), which will be written 


(14.3) C? — 427 + goz+g3 =0 


44 For this consider a field K and a subfield k of K (here take k = C) and assume 
there are d elements x; of K (here take d = 1 and x1 = @) such that every 
element of K is algebraic over the subfield k(a1,...,2a) generated by k and the 
elements x;. This general theorem asserts that there always is a non-trivial alge- 
braic relation P(y1,...,Yya+1) = 0 with coefficients in k between d+ 1 elements 
y; of kK. The reader will find the proof — consisting essentially of linear algebra 
—in Serge Lang, Algebra and several other textbooks. In fact, §§9 and 26 to 29 
of my Cours d’algébre and the corresponding exercises ought to be enough. 
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here so as to follow the notation of Chap. X, n° 4. There is a compact Riemann 


surface X connected to this equation, which except for critical points of (3), is the 
set of solutions (z,¢) of this equation. As (o(u), 9’ (u)) clearly satisfies (3) for u € L, 
the next result arises naturally: 


Theorem 12. The map 
(14.4) j:umod L+—+ (¢(u), o’(u)) 


can be extended to an isomorphism from the Riemann surface C/L onto the algebraic 
Riemann surface X associated to equation (3). 


There would be nothing to prove if we already knew that, when the fields of 
meromorphic functions of two compact Riemann surfaces are isomorphic, so are the 
two surfaces considered. But we will have to forego this general theorem, construct 
the compact Riemann surface of (3) following step by step n° 4 of Chap. X, then 
define j for all u € C and check that, this gives a holomorphic bijection from C/L 
onto X , whose inverse is also holomorphic. In fact, we will construct a holomorphic 
surjective map j from C onto X such that j(u) = j(v) if and only if u = vmod L. 
Passing to the quotient will give the expected isomorphism. 

In accordance with Chap. X, n° 4, we start by removing from C of the values of 
z where the equation in ¢ has a multiple root. These are the roots usually denoted 
by e€1, €2, e3, of the equation 


(14.5 423 — goz —g3 =0. 

They correspond to numbers u € C where g’(u) = 0, As g’ is an odd holomorphic 
elliptic function outside L, o’(u) = 0 if u=—uFZ# Omod L, hence if u=w/2 fora 
period w such that w/2 ¢ L. So, denoting by wi and we a basis of L, 

(14.6 u € {w1/2, w2/2, (wi + we) /2} mod L. 


These three numbers being distinct mod L, 


(14.7 e1 = 9(wi/2) , ex = —(w2/2) , eg =~ [(w1 + we) /2] = —e1 — 2 


since the sum of the roots of (4) is zero. These roots of (4) are simple. Indeed if for 
example e; = €2, then the elliptic functionf(u) = ¢'(u)/2[@(u) — e1] would clearly 
satisfy the equality f(u)? = @(u) — e3, which is impossible since then f would have 
a single simple pole. 

To construct X and the map 7: — ¢, we start by constructing the open 
subset X of X located on top of B = C — {e1, e2, e3}. Topologically it is the set of 
solutions P = (z,¢) € C? of (3) for which z € B, the map a being given on X by 
(2,6) = 2. 

Let us show that, for such an ordered pair, there exists a unique u € C mod L 
for which z = ¢(u), ¢ = g'(u). The function g being elliptic of order 2, the equation 
9(u) = z has two simple roots or one double root, modulo L (corollary of theorem 7). 
In the second case, (uw) = 0, and so z € {e1, e2,e3}, a case temporarily excluded. 
As moreover g is even and g’ odd and as z determines ¢ up to sign, the result 
follows. 

Let 71, 72 and 73 denote points of C where @ takes values e;, e2 and e3, B’ 
the set of u € C distinct from 71, 72, 73 and 0, mod L, and let 92 be the image of 
B’ in C/L, an open subset whose complement consists of the classes of 71, 72, 73 
and 0. We associate the element j(u) of X given by (4) to all u € B’ . Taking the 
quotient mod L, j defines a bijection from 2 onto X. 
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Let us show that 7 is an isomorphism between the non-compact Riemann sur- 
faces X and 2. Let P = (a,a) be a point of X. In the neighbourhood of P, the 
function q(z,¢) = z—a is a conformal representation of a neighbourhood of P on a 
neighbourhood of a since we are on top of a critical point of (3). On the other hand, 
if b € C is chosen so that a = (b) and a = g’(b), the function g is a conformal 
representation of any sufficiently small neighbourhood of 6 on a neighbourhood of 
a since 9'(b) 4 0. The composition of u—> g(u) and the inverse of (z,¢) ++ z-a 
shows that ut > (g(u), o’(u)) = j(u) is clearly also a conformal representation of 
a neighbourhood of 6 on a neighbourhood of (a, a). Thus there is an isomorphism 
between (2 and X. 

j now needs to be defined at the four points of C/L that are not in 2, i.e. at 
U = 71, M2, T3 Or Omod L. 

Let us first analyze the surface X on top of a neighbourhood of a = e1 = ¢(71). 
Let (notation of Chap. X, n° 4) D(a)* be a pointed disc centered at a not containing 
any of the other points e;. The inverse image 7 *[D(a)*] C X is a covering of order 
2 of D(a)*. Let us show it is connected. Otherwise, it would be possible to define 
two uniform branches f(z) of equation (2) (Chap. X, theorem 5 for k = 1) in the 
neighbourhood of a, i.e two solutions of 


f(z)? = 4(z — a)(z — e2)(z — es) 


holomorphic for sufficiently small |z — a| > 0. Such a function f would be bounded 
in the neighbourhood of a, hence holomorphic on a closed disc centered at a and 
zero at a: this is absurd since the right hand side has a simple zero at a. 

Hence this shows that there is a unique point P; of X on top of each critical 
point a = e;. When P € X converges to P;, the ordered pair (z,¢) = P converges 
in C? to (e:,0), which enables us to identify P; with (e:,0) = ((m), '(mi)). Thus 
in accordance to (4), we set j(7:) = P;. This defines 7 on C — L, ie. on C/L with 
the class of 0 removed. 7 is clearly continuous. 

To show that j is an isomorphism in 7;, one needs to use a local uniformizer q; 
at P;. Now, we know [Chap. X, eq. (4.2)] that it can be chosen in such a way that 


(14.8) gi(z,¢)° =2z-e: =u) — (mi) if (2,0) = 5). 


As g'(:) = 0, on the other hand, in the neighbourhood of 7, 


o(u) — (mi) = (u—m)” g(u), 
where g is defined and holomorphic on a neighbourhood of 7; and non-trivial at 7;, 
and so qi[j(u)]? = (u — 7:)?g(u) in the neighbourhood of 7;. As g(m) 4 0, there 
is a holomorphic function such that g(u) = f(u)? on a neighbourhood of 7;. It is 
unique up to sign. Both sides of 


(14.9) gi li(u)]? = (u— mi)” f(u)? 


are holomorphic on a pointed disc D(z;)* since j is holomorphic except at the 
points 7;. Hence, as D(z;)* is connected, if need be replacing f by —f, 


(14.10) gi i(w)| = (w— m) fu). 


It follows that the left hand side is holomorphic on a neighbourhood of 7;, which 
shows that 7 is holomorphic as a map from C — L to the Riemann surface Xx. 
Therefore, as j is clearly injective, it is an isomorphism from C — L onto X with 
the points projecting onto co € C removed. 
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To finish the construction of j, let us analyze the surface X on top of the 
point oo of the Riemann sphere. To reduce to finite values of z and ¢, we set 
z=1/2',¢=1/¢', which transforms equation (3) into 


(14.11) z'* — ¢? (4— go2” — gsz*) =0 


which has to studied in a neighbourhood of z’ = 0, a value where it has the double 
root ¢’ = 0. The point at infinity is, therefore, a critical point of the initial equa- 
tion (3). If there were two points of X on top of oo, the algebraic function ¢’ = F(z’) 
defined by (11) would decompose into two uniform branches f(z’) and —f(z’) ina 
neighbourhood of 0, and by (11), one would have f(z’)? = z'%g(z') with g(0) = 1/4. 


This is obviously impossible. So there is a unique point P. of X projecting onto 
the point z = oo of C, with a local uniformizer qx satisfying qouo(z',¢’)? = z’ as 
above or, coming back to the initial variables, 


daa(Z,C)- = 1/z, 


which holds for large |z|. 

To define j at 0, it is natural to set 7(0) = Ps. We still need to show that j 
is an isomorphism from a neighbourhood D(0) of 0 onto a neighbourhood of Ps. 
However, j(u) = (e(u), g’(u)) on D(0)*, and so 


qoo [j(u)]” = 1/p(u). 


As 0 is a double pole of ¢, 1/g(u) = u? f(u)? where f is holomorphic on D(0) and 
not zero at 0. The arguments used for neighbourhoods of the other critical points 
imply that qoo[j(u)] = uf(u), which ends the proof of theorem 12. 


(iii) Addition formula. Corollary 1 of theorem 11 enables us to show for example 
that, if f(u) is an elliptic function, then for all n € Z, there is a non-trivial algebraic 
relation between functions f(u) and f(nu): it suffices to observe that u+—+> f(nu) 
admits the same periods (as well as others...) as f(u). Ever since the second half 
of the 19th century, the study of these relations has given rise to research that has 
powerfully contributed to the development of the theory of algebraic equations, in 
particular Galois theory. Similarly, for all a € C, there is a relation between f(u) 
and f(u+a), in other words an “addition theorem”. Abel and Jacobi were familiar 
with these results. Particular cases were known even before them. These had been 
obtained by explicit calculations sometimes close to wizardry.*® 

If f is the g-function of the lattice L considered, the addition theorem is ob- 
tained very simply. To calculate it, suppose for the moment that 2a is not in L. 
This condition ensures two propositions used in the proof: (a) the only zeros of the 
function g(u) — g(a) are the obvious ones, namely the points a and —amod L; (b) 
g(a) #0 since a has just been shown to be a simple zero of (uw) — (a). 

This being so, the only singularities of the function g(u — a) are double poles 
at points w +a, where w € L. In the neighbourhood of such a point, it is the sum 
of 1/(u — a)? and of a power series in (u — a)” without any constant term. As on 
the other hand g is holomorphic at a, g(u) — g(a) = g’(a)(u — a)[1+...], with 
g' (a) #0 as has just been shown. So the elliptic function 


(14.12) g(u) = p(u — a) — 9'(a)* /[p(u) — g(a)? 


45 See Houzel’s article in Dieudonné’s Abrégé d’histoire des mathématiques tome 2, 
in particular pp. 8-11 that are difficult to understand without referring to sources 
since they present little else but results formulated in the notation of the time. 
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has at most a simple pole at a. Its other possible poles are the zeros of 9(u) — g(a), 
i.e. the points uw = a or uw = —amod L by (a), and these are double poles since 
9’ (a) #0. But as o’(u) is odd, (wu) + e'(a) = 0 for u = —a. To remove the double 
pole at a, it is, therefore, sufficient to replace function (12) by 


(14.13) h(u) = (ua) - [pw +9]? / Alou) - (OP 


Since, in the neighbourhood of u = a, 


[o'(u) + 6'(@)]”/4 lo(u) - @(a))? 


[20"(a) +.. }/4 [e'(a)(u-a)+...]? = 


2 


=(u-—a) “4+. 


and g(u— a) = (u—a)~? +..., like g, the new function A, has at most a simple 
pole at a. 

Thus everything has been done to remove the pole of g at —a, but replacing 
g by hf introduces a singularity at w = 0, which now needs to be investigated by 
writing down the first few terms of the Laurent series of the functions concerned 
at the origin. 

Since o(u) =u? + agu? +... in the neighbourhood of 0, 


p(u) — p(a) =u? [1 — p(a)u? + gou4 +...) , 
[p(u) — g(a)? = u~* [1 — 2p(a)u? +...] , 
(14.14) [0(u) — p(a)|~? = ut [1+ 2p(a)ju? +...] . 
Similarly, '(u) = —2u7? + 2a2u+..., and so 


et 
(14.15) [@’ (uw) + e'(a)]’/4 =e [1 — g'(a)u® +.. A 


Multiplying (14) and (15) sidewise, it follows that 


[o'(u) + e'(a)]” /4[o(u) - @(a))? = uw? [1+ 20(a)u? +...) = 


=u ?42(a)+.... 


As (u— a) = (-a@) + up'(—a) +... = 9(a)—g'(a)ut..., 
(14.16) A(u) = —u-? — g(a) +... . 


Thus the function h(u) + e(u) is holomorphic at the origin, and so at points of L. 
Outside L, it has the same singularities as h, hence at most simple poles at points 
u = amod L. In other words, h + ¢ is an elliptic function of order < 1, and so is 
constant. However, (8) shows that h(u) + o(u) = —p(a)+... in the neighbourhood 
of 0. Therefore, h(u) + o(u) = —(a), and taking (13) into account, we get 


(4.17) p(u-a) = [6'w) + 9'@)]*/ 410&) - o@? - 0) - 0). 
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This result is generally written in its most symmetric form 


(14.18) plu + v) + plu) + e(v) = i Gocom) , 


The proof of (18) was nonetheless based on the assumption that a, or —v in the 
notation of (18), is not a half-period. This restriction is irrelevant since, for given 
u, both sides of (10) are meromorphic functions of v. If relation (18) holds outside 
a discrete subset of C, it does so for every v where both sides are well-defined, in 
other words provided none of the three numbers u,v, u+ v is a period. 

For u = v, the fraction appearing in (18) obviously needs to be calculated by 
passing to the limit, which leads to the duplication formula 


(14.19) p(2u) + 2(u) = "(u)?/ 4p"(u)?. 


Calculating @” by differentiating the differential equation of the g-function gives 


[0(u)? + g2/ 4]? + 2930(u) 


Oty: 4p(u)? — go@(u) — 93 


The geometric interpretation of (18) consists in considering the curve in C? 
having equation 
(14.20) y” = 42 — gor — 93. 
The differential equation of the g-function gives a parametric representation 
(14.21) ur (9(u), @'(u)) 


of it, as was was seen in the proof of theorem 12. If y = mx +p is the equation of a 
line in C?, the x-coordinates of its intersection points with the curve are the roots 
of the equation 


Ax? g2x — 93 — (ma 4 p)° =0. 


Hence, if A = (a,a’), B = (b,b’) and C = (c,c’) are these three points, then 
at+b+c= m? /4, Next let u, v and w be parameters, determined mod L, of the 
points A, B and C. The slope of the line AB being 


m= (8 —a)/ (0-0) = [6'@) —6'w)]/ lol) — oC) , 


we get 


e(u) + e(v) + e(w) = 


Comparing with the addition formula, we conclude that o(w) = g(u+v), whence 
w=+(u+v). As u, v and w can be arbitrarily permuted, u+vu+w = 0. Therefore, 
the addition formula says that the points with parameter u, v and —(u+v) of curve 
(20) are aligned. 

These calculations assume that none of the points u, v, u+v is in L. In particular 
there is no point O on the curve such that A +O = O for all A since it would 
correspond to the value wu = 0 of the parameter. To get rid of this restriction, 
we need to work in the complex projective plane P2(C) and replace (19) by the 
equation 


(14.22) 4x? — y?t — goat? — gat? = 0 
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where x,y,t denote the standard homogeneous coordinates [Chap. IX, §4, n° 11, 
(iii)] in P2(C), which adds a point “at infinity” to the curve having homogeneous 
coordinates (0,1,0). Map (21) can then be defined for all wu € C: if u is not in 
L, it associates the point with homogeneous coordinates (¢(u), e’(u), 1) to u. In 
the neighbourhood of 0, we for example replace these homogeneous coordinates 
by (u2@(u), u2' (uw), u?), and find the point (0, —2,0) at the limit, equivalently, the 
point (0, 1,0) at infinity of the projective curve. Map (21) becomes injective when it 
is interpreted as a map from C/L to P2(C). Its image is precisely curve (22). All this 
is simply a restatement of theorem 12. As the composition law (u,v) —> u+v in C 
passes to the quotient by the subgroup JL, it induces a commutative group structure 
on C/L, hence also on curve (14). Its identity element is then the already O with 
homogeneous coordinates (0,1,0) we have already found, i.e. the only intersection 
point of (21) with the line at infinity of P2(C) and hence an inflection point of 
the curve. If A, B are two points of the curve, the point C = A+ B given by 
the composition law is just the symmetric point with respect to Ox of the third 
intersection point of the line AB (or, if A = B, of the tangent at A) with the curve. 
Exercise 4. Give a geometric interpretation for the equality 


ALB EO) = Ary eC. 


To go further than this brief overview, it is necessary to go deeper into the much 
more general theory of Abelian varieties; a far too extensive program. 
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§5. SL2(IR) as a Locally Compact Group 


15 — Subgroups, Invariant Measure 


(i) Actions of SL2(R) on the half-plane. In the theory of modular or more generally 
automorphic functions, one uses the group*® G = SLe2 (R) of matrices 


o=(2 3 with a,b,c,d€R, ad—be=1, 


and so 


Setting 
gz = (az + b)/(cz + d) 
in order to simplify notation and observing that 
(15.1) Im(gz) = |ez + d|? Im(z) 
has the same sign as Im(z), one sees that G acts on the upper half-plane 
P :Im(z) > 0 


by conformal representations z +— gz. These are the only ones: this is an easy 
classic result. For all z,z’ € P, there clearly exists g € G such that z’ = gz. In fact, 
only matrices with c = 0 need be considered. 

If I is an arbitrary subgroup of G, the stabilizer of any z € P in I is the 
subgroup I’, of y € I such that yz = z. For I = G, I will rather denote it by Ky. 
The reason for this will readily become apparent. Writing z = xi for x € G, one 
need only search for elements g such that gxi = xi, i.e. such that x~gx € K;. This 


subgroup is the set of matrices such that a = d, c = —b, hence of the form 
cos2m7t = sin 2at 
(15.2) k(t) = fe sin2at cos si 


with real t. K;, denoted simply K, is compact and isomorphic to R/Z or T. So in 
the general case, 


K, =aKe' if z=vi. 


This is a compact subgroup like Kk. 

As every z € P is the image of some 7 under a unique g € G mod K, P may 
be identified with the homogeneous space G/K, even from a topological point of 
view. Thus G acts properly on P, and more generally so does any closed subgroup 
I’, for example discrete, of G (Chap. XI, n° 15, theorem 24). In particular, the 
quotient space I’\P is locally compact and for all compact sets A, B C P, the set 
of y € I such that yAN B # 0 is compact, and so finite if I’ is discrete. The theory 
of automorphic functions would be impossible without these two results. 


(ii) Automorphic forms as functions on G. It is convenient to set 


46 Some authors prefer to use the group GL} (R) of matrices with determinant > 0. 
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15.3) J(g;z) =cz+d. 
Thus d(gz)/dz = J(g;z)~*, Im(gz) = |J(g, z)|7? Im(z), and 
15.4) I(9'9:2) = J(9's 92) I(93 2)- 


This last formula is basic. As J(e;z) = 1, it follows that 


15.4’) Ig G2) = G2) 


These functions are needed for functional equations of automorphic functions. 
Given a discrete subgroup I’ de G, for example the modular group S'L2(Z) = G(Z) 
of integer matrices,*’ I will say that a function f(z) — not necessarily holomorphic 
or meromorphic, except in the classical theory — satisfying 


(15.5) f(yz) = Jy, 2)" f(z) for all ye I 


is a generalized automorphic form of integer weight r for I’. For G(Z), the most 
obvious examples are the functions 


Gon(z) = S (ez +d)~” 


of the theory of elliptic functions (§ 3, n° 13). 
Associating the function 


(15.6) Lp(a) f :z-— J(a7*;2)7" f(a *z) 


to every f(z) in P and to every x € G, for each r € Z, one gets linear operators 

L,(a) in the vector space of numerical functions defined on P. By definition, for 
48 

ZyeG, 


L,(2)Lr(y) f(z) = J (e732) " Er(y)f (7"z) = 
Stile 32) Sg ee ly eS 
= JS (y ta s2z)" f(y te *z) = Lewy) f(z), 
whence 
(15.7) L,(a)Lr(y) = Lr(ay). 


If I’ is a discrete subgroup of G, the functional equation 
f (yz) = (cz + d)" f(z) 


47 The reasons for the notation G(Z) are as follows. Let G C SIn(C) be a group 
of matrices g = (gi;) defined by the condition det(g) — 1 = 0 and polynomial 
equations p:(g) = 0 with coefficients in Z. For every commutative ring A con- 
taining Z, one can then consider the set G(A) C SLn(A) of solutions with coef- 
ficients gj; € A. It is a subgroup of SL, (A), a result which is less obvious in the 
general case than in those we will encounter here. The case of an algebraic sub- 
group of GL, reduces to the previous one by associating the “diagonal” matrix 
(x, det(x)~”) of order n+1 to every matrix « € GL,,. For a ring A not containing 
Z, for example a field whose characteristic p # 0 divides the coefficients of the 
pi, it may be that G(A) = SLn(A) or GL (A)... 

48 Recall that, as in Chap. IX, a notation like L,(g~+)f(z) denotes the value of the 
function L,(g~')f at the z. 
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then becomes 
L(y)f=f foralyer. 


All this looks like translation operators of group theory, and in fact, easily reduces 
to them. For example, for r = 0, one can associate the function 


folg)=f(z) if z=gi 


on G' to every function f on P. This gives all the solutions of 


v(gk) = v(g) 


and clearly, 


f(y2z) = f(z) = folyg) = folg)- 


For r 4 0, we associate the function 


_ . : are ai+b 
(158) Sela) = Le (9) = Haid" H0a) = (i +f (SE8) 
to f, where the L, are defined as above. For x € G, let us set ‘hae = L,(x)f and let 
f; be the function corresponding to f’ on G. Applying (8) to f’ and using (7) gives 


f2(g) = Lr (g7*) f'@) = Lr (g7*) Lr(x) f@) = Ly (g7*2) f(@) = fe (27 *g) - 


So the map f +> f, transforms the operator L,(zx) into the left translation operator 


(15.9) L(z)p=a*p grog (a~*g) . 


which brings us back to a situation familiar in group theory and to the “commutative 
diagram” 


L(x) 
fe ee 
ai 
iment 


Since the relation L,(x)f = f is equivalent to L(x)f, = fr, for all y € G, 


(15.10) f'(2) =I 2) "fF vz) = f(g) = frrg)- 


For every subgroup I of G, going from f to f;, thus transforms the solutions of (5) 
into left I’-invariant functions. 

Independently of any subgroup I’, function (8) has another quasi-invariance 
property. Let us consider the subgroup K in G. For matrix (2) with parameter ¢, 


(15.11) J(k;i) = exp(—2it) = e(—t); 


the definition of f, then shows that 

fr(gk) = J(gks i)" f(gkt) = J(g3t) "J(ks i)" f (gt) = J(kt) " fr(g) - 
So 
(15.12) fr(gk) = fr(g)xr(k) forkek, ge€G, 


352 XII — The Garden of Modular Delights 


where the functions 
(15.13) xXr(k) = e(rt) = J(k;i)~" for k = k(t) 


are characters of the compact group K. Reversing the order of calculations shows 
that conversely, for every function y of weight r on G, where a function of weight r 
on G designates a solution of (12), conversely there is a function 


f(z) =J(g;2)"e(9) = (z = gi) 


on the half-plane P such that y = f;. It will sometimes be written yp(z). Clearly, 
the set F,(G) of functions of weight r on G is stable under left translations and 
every function y on G at least slightly reasonable can be represented by a series 
of functions of weight r: it suffices to expand k +> y(gk) as a Fourier series. Let 
H,(G) denote the set of functions of weight r induced by holomorphic functions on 
P. They will sometimes be called holomorphic functions of weight r on G. 

We start from a generalized automorphic form f of weight r for a subgroup I’. 
The corresponding function f, then satisfies 


fr(ygk) = fr(g)xr(k) 


and this relation is precisely functional equation (5). 

We have thereby obtained a result which, judging from some recent books, does 
not seems to be yet adopted by all experts of the classical theory though it has been 
known at least since 1950. Despite being easy — it would take more to trivialize the 
theory of automorphic functions — it ought to be stated: 


Theorem 13. Let r be a rational integer and to every function f defined on the 
upper half-plane P let us associate the function on G = SL2(R) defined by 


fr(g) = J(g34) "Ff (gt)- 


The functions obtained are characterized by the relation 


y(gk) = v(g)xr(k) forallkek. 


For all y € G = SL2(R), the functional equation 


(15.14) f(yz) = S052)" F(z) 
is equivalent to 
(15.15) Fr(yg) = fr(g)- 


In the most classical theory, I" is the subgroup SL2(Z) of integer matrices 
which, as already mentioned, I will often denote by G(Z). But there are many 
other interesting groups I’, in particular the congruence subgroups of the modular 
group G(Z). To define them, we choose an integer N > 1 and, for any matrix 
y € G(Z), let yn denote the matrix obtained by reducing the entries of y mod N. 
This gives rise to a homomorphism 


SL(Z) —> SLo(Z/NZ) 


which is surjective (exercise). Its kernel, written ['(N), is thus the set of 7 such 
that y = 1(mod NV). This being so, the congruence subgroups of G(Z) are those 
containing some I'(N). The group I°(@) of the Jacobi function, generated by z +> 
—1/z and z+ z+4 2, contains ['(2). 
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As G(Z) contains the matrix —1, condition (14) clearly has no solution f 4 0 
if r is odd. But the group I'(NV) does not contain the matrix —1 for N > 3, so that 
r can take any values. 

There is also the much more general class of Poincaré’s Fuchsian groups, which 
will be defined later. They are not characterized by arithmetic conditions, so that 
in their case, results easily proved for G(Z) — convergence of Eisenstein series, 
construction of fundamental domains, etc. — require more complicated arguments. 
This inconvenience is in fact an advantage for those who want to understand the 
subject as it gets rid of ad hoc arguments that cannot be generalized. 


(iii) Subgroups of SL2. Clearly, 
(15.16) J(g;z) =1 forall z = g= ¢ i) = x(u) 


for some u € R. The set of these matrices is a subgroup U of G which acts by 

translations z > z+ u on the half-plane. Instead of x(u) we will often write wu, in 

other words, make no distinction between the matrix x(u) and its parameter u. 
Clearly, by (3), 


(15.17) J (99; 2) = J(g;2) for all z <=> g' EU, 
or 
(15.17’) J (9; z) = J(g;z) for all z = g' CUg. 


Hence the function z > J(g;z) only depends on the coset Ug of g modulo U and, 
in fact, characterizes it. 

We will also constantly need the subgroup’? B of triangular matrices (c = 0). 
It acts on P by similitudes z +> az+ 6 with a real > 0 and b E R. If A denotes 
the subgroup of matrices 


(15.18) Re ({ ea) =n, #0, 


and A, the subgroup t > 0 of A, then clearly, any b € B can be written in a unique 
way as b= hu with h € A andu € U and hAUh~* =U for allh € A: 


(15.19) € ey G a a , = G a) where o(h) =?. 


The subgroups gBg~' of G are called Borel subgroups of G. As any matrix in 


GL,,(R) with real eigenvalues is well known to be the conjugate of a triangular 
matrix, it follows that any g € G satisfying this condition belongs to a Borel 
subgroup. 

Introducing the matrix”? 


(15.20) ie « my aig 


49 Borel (Armand)’s initial, who made much use of it in far less trivial situations. In- 
cidentally, let me mention his Automorphic Forms on SL2(R) (Cambridge Tracts, 
1997), a presentation of the new theory (Maa-Selberg) of non-holomorphic auto- 
morphic functions for a Fuchsian group from a purely group-theoretic viewpoint. 

°° The character S$ is used in the theory of modular functions (z + —1/z); the 
character w, Hermann Weyl’s initial, refers to the theory of semisimple groups. 
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which acts by z ++ —1/z on P, we get 
15.21) G=BUBwB 


( 
(Bruhat decomposition in a trivial case).°' Since B = AU and Aw = wA, every 
g ¢ B can even be written as 


g=uwb withucU, beEB. 


An elementary calculation shows that this decomposition is unique. Analogous sub- 
groups are encountered in all semisimple Lie groups and all linear algebraic groups 
over an arbitrary field. 

Exercise 1. Every matrix g such that c £ 0 can be written in a unique way as 


(15.22) g=x(u') € ) 2G )-s 


Let us give some other ways of writing g € G using subgroups already defined. 
The first one is the Iwasawa decomposition in a trivial case. It consists in writing 
that 


g = bk = uhk 
with be B,heE A,ucU andke€ K. Indeed 


1 
. (1 2\ fy? 0 ee 
(15.23) c+iv=(j i) & py)ice 


for all z = x + ty © P, so that setting gi = z, g is the product of the previous 
matrix, which is in B, and a matrix leaving the point 7 fixed, so in K. (23) can also 
be written 


(15.23’) x(u)hki = ut a(h)i 


foru€¢U,h€ A, k € K, where a(h) = ¢ already occurred in (19). The definition 
of a can be conveniently generalized to all of G by setting 


(15.24) a(g) = Im(gi) = J(g;i)~?. 
Obviously 


5! For a considerably less trivial example, see my Cours d’Algébre, § 15, exercise 23. 
In my young days, when traumatizing secondary school children still went on, 
we used to learn that 


(ad — be) /c? 
azt+d/ce ° 


ax+b _ 
cx +d 


a/c 


from which we deduced that every homographic transformation decomposes into 
two translations, a homothety and the inversion « > —1/a. Exercises 20, 21 and 
22 will be useful for other similar algebraic properties of SL2z (commutator group, 
normal subgroups, definition by generators and relations). These exercises are 
inspired from C. Chevalley who proved all this for “split” semisimple algebraic 
groups over a commutative field in his famous article in Tohoku Math. Journal 
and thus started a theory that has in particular led to enormous progress in our 
knowledge of finite groups. 
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(15.25) a(uhgk) = a(h)a(g) 
for all u,h,g,k and, for z = g'i, 

I(g;2) =F (9:9't) = FI (99's) / I (9st) , 
whence 


(15.26) J(g;2z)? =a (99') /a(g') for z=g't. 


Two other occasionally useful decompositions consist in putting every g € G in 
the form 


(15.27) g= kp = ki hke 


with k,ki,k2 € K, p symmetric positive definite and h € A,. For this observe that 
if g’ denotes the transpose of g, then the matrix g’g is symmetric positive definite, 
and so g’g = p” where p is symmetric positive definite.°? Then the matrix k = p-‘g 
satisfies k’k = 1, proving the first result. It shows that P could be identified with 
the space of symmetric positive definite matrices by associating to a matrix p of 
this type the point z = pi of P. 

On the other hand, any symmetric real matrix can be diagonalized over R using 
an orthonormal basis change in the Euclidean space it acts on. This means that 
p = k5'hke with kz orthogonal, h € A and even h € A; if p is positive definite. 
The first relation (27) thus implies the second one. 


(iv) Fixed points and eigenvalues. The eigenvalues of g € G are the roots of the 
equation 


? — Tr(gjt +1=0. 


They are real and distinct if | Tr(g)| > 2; g is then said to be hyperbolic. In this case, 
IR? has a basis of eigenvectors of g, so that hyperbolic matrices are the conjugates 
in G of the matrices h € A other than € = +1. 

If | Tr(g)| = 2, g is said to be parabolic. So g has a double eigenvalue equal to 1 
or —1. Then there is a vector a € R? such that g(a) = ea with e = +1. Choosing 
b € R? non-proportional to a, g(b) = ua+eb for some u € R. Thus g is the conjugate 
of a matrix u € U, up to the factor e. Therefore, in both cases, g belongs to a Borel 
subgroup of G. 

If | Tr(g)| < 2, g is said to be elliptic. Its eigenvalues are imaginary and g can 
no longer be triangularized or diagonalized over R in this case. To bring back g in a 
standard subgroup of G, we look for the fixed points of g in P, defined for all g E G 
by gz = z. Assume that g 4 +1. The solutions are the roots of the equation 


cz? + (d—a)z—b=0. 


If g est elliptic, then c # 0. This is then a quadratic equation with discriminant 
Tr(g)* —4 < 0 and hence two distinct conjugate imaginary roots. One of them being 
in P, an elliptic matrix is thus the conjugate of an element of K and conversely. 


5 . “ys . . . . . 
52 For any real symmetric positive definite matrix p, there exists a unique matrix 


2 
p2 of the same type such that p = (vp?) . Existence: diagonalize p. Uniqueness: 


diagonalize p2 : 
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(v) Invariant measure. There is a left invariant measure dg on G like in the case 
of a locally compact group. It is actually also right invariant for the simple reason 
that there is no non-trivial homomorphism from G to a commutative group.° 

Since P = G/K where K is compact, there is a G-invariant measure m on P; 
it is unique up to a constant factor and 


(15.28) J otoraa =f amiz) fotamak (= ai) 


for any function y € L(G), or more generally integrable. It can also be obtained by 
transforming every function f(z) on P into the function fo(g) = f(gi) and setting 


(15.28’) [fe )dm(z )= | toloag 


where dg is a left invariant measure on G. The decomposition G = AUK or, 
equivalently, the map b +> bi, enables us to identify By = AU with P = G/K, 
which transforms the action of B; on P into left translations on B+. Considered as 
a measure on B+, m must therefore be the left invariant measure db of the subgroup 
Bx. But for B = AU, one readily checks that it is possible to assume 


[owas /f y(hu)dhdu 


where dh and du are the obvious invariant measures on A and U. Using calculations 
similar to (23), it can be deduced that 


(15.29) dm(z) = y *dady . 


Its G-invariance can be directly checked using the change of variable formula for 
multiple integrals. In fact dm(z) is associated to the invariant differential form 
w=y dz Adz. 

Formula (28’) defines a positive linear functional on the functions f(z). It is 
invariant under the operator Lo(x) for all x € G. To obtain non-trivial integrals 
invariant under all L,(«), it suffices to integrate |f,(g)|? (1 < p < +00) so as to 
obtain strictly K-invariant functions on G. 

As it is then sufficient to integrate over B, the result is obtained by observing 
that, by definition (9) of fr, 


fr(b) =y" f(z) if z= bi, 
IF l=9"? |F(z)| if z =i. 


Hence 

(15.30) [ov P t@lam(e) = f el@lda = Mel 

This expression is invariant under operators L,(x). The same is true for the integral 
(15.30°) 10"), = f vf OF Came) = (F411) 


53 An algebraic proof would require showing that, as an abstract group, G is gen- 
erated by its commutators ryxty~1 
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the inner product on L?(G). Hence generally speaking, one can define spaces L?(P) 
by the condition that 


Hans Petersson** introduced or used these notions without referring to the 
notion of a quotient measure. He defined the inner product of two automorphic 
forms f’ and f” of weight r by integrating over a fundamental domain of I in 
P. Weil’s book, published in 1940 and which German specialists of the time did 
not read, gave the correct solution: every discrete subgroup I’ of G acts properly 
on G or P, I'\G is locally compact, and so is ['\\P = I'\G/K (Chap. XI, n° 15, 
theorem 24). Associating the function 


(15.31) ¢" (9) = >_ (yg) 


to every y € L(G), which gives a map from L(G) onto L(I’\G), defines a “natural” 
measure A on X = I'\G by setting 


y’? f(2)|' dm(z) < +00. 


G 


(15.32) | e@are) = | v(adg. 
r\G 
We will always write this formula in the abusive but convenient form 


(15.32’) he o" (gag = | y(g)dg. 


G 


If one insists on working in the upper half-plane and on defining the inner product 
of two automorphic forms f’ and f” of weight r for I in a natural way, then let 


(15.33) (s'| f”) = a gf OF Cag = ie gh fOr Ean), 


Formula (32) holds for every function f € L'(G): then the series > f (yg) converges 
absolutely almost everywhere and its sum is in L'(I’\G). This is a particularly 
simple application of the generalized Lebesgue-Fubini theorem [Chap. XI, n° 13, 
(ii)]: denoting by a the canonical map from G onto I’\G and associating the measure 


fix: f+» 5° f(yg) where n(g) =x 
to each « € I’\G, relation (32) means that the measure dg of G is the integral of 


the measures jt, with respect to the measure >. Theorem 22 of Chap. XI, n° 13, 
(ii) then shows that f € L1(G) implies that 


Se |f(vg)| < too almost everywhere 


(and, in good cases, everywhere), that the function f"(g) = S> f (7g) is integrable 
over I'\G and that (32) again holds. 


(vi) The point of view of the unit disc. It is sometimes useful to rewrite the 
definitions and results of this n° differently using the conformal representation 


4 ber eine Metrisierung der ganzen Modulformen (Jahresber. Deutschen Math. 
Verein., 49, 1939, pp. 49-75). 
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ze > (z—i)/(z +1) = ¢ of P on the unit disc D. Easy calculations show that 
SL2(R) is replaced by the group G’ of matrices 


(15.34) (; A such that aa — bb = 1, 


K becoming the group K’ of rotations about 0. More precisely, if one chooses a 
matrix s with determinant 1 such that z = s¢ (which determines it up to the factor 
+1), the map 


(15.35) grog =s ‘gs 


is an isomorphism from G onto G’ transforming every subgroup I" of G into a 
subgroup I” of G’. In particular, the matrix k(t) € K defined in (2) is transformed 
into 


(15.36) k(t)! = k(t) = Ge eo) 


as can be easily seen. 
On the other hand, 


(15.37) Im(z) = (1-¢@)/|1-¢)? . 
Since z = i(¢ + 1)/(1-—), 
dz =2id¢/(1—¢)?,  dzAdz= 4d¢Adé/\1—¢|*. 


2 


Multiplication by y~? = |1 — ¢|*(1 — ¢C)~? gives a measure on D invariant under 


group (34), namely 


(15.38) dm(¢) =4(1-I¢)?) * aédn, 


the image of dm(z). 
The functional equation 


(15.39) fz) = Jz)" fF) 


of automorphic forms for the discrete group I" can be easily reworded. Setting 
J(g;z) = cz +d for any complex matrix g with determinant 1 and all z € C, the 
identities 


J (99'32) =I (g:9'2) J (9's2) (gz) = J(g3z) 7dz 
continue to hold. Using (35), relation (39) becomes 
Ff (svg) = Flysd) = Jv; 8)" F(8¢) = Js 6)" J(8; 6)" F(s¢) = 
= J (87'3¢)" J(s3¢) "Ff (s0) = 
= J(837'6)" 7 (50)" J(83 0) F (86). 
Associating the function 
(15.40) fo(Q) = J(s30)"F(s6) = J(850) "Ff (@) = (ss 2)" F(2) 


to f(z), which is holomorphic if and only if so is f, we thereby get the functional 
equation 


§ 5. SL2(R) as a Locally Compact Group 359 


(15.41) fo’) = JV)" fod), 
which replaces (39). Note that, for all f, 
(15.41’) fo) = (2 +4)"f(z), 


up to a factor +V2. 

Everything that has been said above for integer r of operators L,(g) in the 
space of functions on P can be trivially transposed. The formulas are the same. On 
the other hand, (37) and (40) show that 


x? f(z) = (1-71-78, 0" fod). 


As s has determinant 1, 
J(s;¢) "= defdé = 20)? 


So J(s;¢) = (1- ¢)/(2i)?. This is the value of i? that should be used depending 
on the matrix s chosen. Hence 


(15.42) y"? f(z) =o (1- CO)" fod), 


where cp = gr/2(43)". 

In theory, all this assumes r is an integer. For non-integer real r, (40) can still 
be used to associate a function fp defined on D to every function defined on P: it 
suffices to choose once and for all a uniform branch of J(s;¢)~" on D. Relation (42) 
continues to hold if c, is suitably defined. In particular, for all p > 1, 


(15.43) [lw se) Pam(e) = f a= ey" Fo(6)[Pamn(o) 


(up to a constant factor). For p = +00, we replace the integrals by the norms of 
uniform convergence. 


16 — The Discrete Series of Representations of SL2(R) 


In Chap. VIII, n° 12, we defined a space”? Hz(P), for all real r = 2k. It is the set 
of holomorphic functions on P such that 


lfrlly = i, y’!|f(2)|dm(2) < +00. 


One can more generally define spaces H?(P) by setting 


55 The notation adopted here is different from those of chap. VIII: the measure 
written dm(z) in Chap. VIII is drdy. For what follows, the results of Chap. VIII 
need to be applied to the measure dyi(z) = y*~?dady = y"/*dm/(z) in the notation 
adopted here, so that here the function e(y) of Chap. VIII is y*—?. Finally I write 
Hx(P) for the space denoted by H;(P) in chap. VIII, and H2(G) for the space 
of f, where f € H+(P). This is a closed subspace of L'(G). In the maps that 
interest us here, r is an integer, but everything remains valid for all real numbers 
r, provided (34) is taken as the definition of the symbol ||f;,||. The case of a 
non-integer real exponent r occurs when SL2(R) is replaced by its universal 
covering. 
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Ise, = (f 


Il frlloo = SUP 


w?sa"amte))” @< t00), 


y" f(2)| < +00. 


For integer r, the spaces which correspond to them on G by theorem 13 of n° 15, 
(ii) are 


H?(G) =U-(G)N L(G). 


For all r € R, H2(P) is complete and in fact convergence in mean implies 
compact convergence in P [Chap. VIII, n° 4, (iv)], or in G if r € Z. For integer r, 
these spaces play an important role in the theory of automorphic functions as well 
as for harmonic analysis on the group G. The same is true for non-integer r — for 
some time now, experts have been devoting much energy to the study of modular 
forms of weight 1/2, 1 and 3/2 -, but then we must accept to replace G by its 
universal covering, a topic that will not be broached in this chapter.°° Nonetheless, 
for a real r, it should be observed that the formula 


L(g") f(z) = J(gs 2)" (92), 
though it is not well-defined for all g € G, does not raise any difficulties for g € By: 


it suffices to set 


(16.1) L,(b) f(z) =t"f(P22+u) for at ae t>0 


defining as usual t” = exp(rlogt). We thereby get a representation of By on each 
H®?(P). This remark will prove useful later, so will the expression 


i3 =f \r-(8)PPab 


of the norm on H?2(P). This equality is obvious since dg = dbdk for g = bk. 
We start by recalling and completing the properties of functions of H}(P). 


(i) Integrable holomorphic functions on the half-plane. Let us make the weaker 
assumption 


(16.2) / y"? |f(z)|dm(z) < +00 for all T>0. 
Im(z)<T 


56 See a detailed construction in R. Godement, Introduction 4 la théorie des groupes 
de Lie (Springer, 2003), §2, n° 7. An element of the covering is an ordered pair 
g = (g,w) where the continuous and real-valued function w(z) satisfies 


J(9;2) = |J(g; 2)lelw(z)], 


and so is one of the uniform branches in P of the “function” Arg(cz +d), up to 
the factor 27. Multiplication is defined by 


(9',w") (g",w") =(g,w) where g=g'g", w(z) =u" (gz) +w"(z). 


G cannot be realized as a matrix group. On forms of weight 1/2, see the somewhat 
opaque Chap. IV of Neal Koblitz, Introduction to Elliptic Curves and Modular 
Forms (2d ed., Springer, 1993). 
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As was seen in Chap. VIII by comparing it to the integral of f over a horizontal 
strip of finite width, the series )> f(z +) and hence all its derived series converge 
normally in all compact subsets of P. The function « ++ f(x + iy) and all its 
derivatives are therefore integrable and tend to 0 at infinity. The Fourier transform 
of f(x + iy) is continuous and O(t~%) for all N > 0, and thus is integrable. The 
Fourier inversion formula can, therefore, be applied. But integrating along the con- 
tour of a horizontal rectangle whose vertical sides tend to infinity, we had shown 
that the integral 


(16.3) f@®= i f(z)e(—tz)dz = e(—ity) | f(a + iy)e(—ta)dax 
Im(z)=y 

does not depend on y. Then the inversion formula becomes 

(16.4) f(z)= / f (t)e(tz)dt 


and shows that f is a complex Fourier transform. Finally, the Poisson summation 
formula applied to x +> f(a + iy) shows that 


(16.5) So flz+n) =o f(nje(nz), 
both series converging normally in every horizontal strip of finite width. All this 


remains necessarily true if f € H;(P). 
Moreover, by (3), 


|f(0)|exp(-2anty) < fife +in)iae, 


whence 


ie Fo exp(—2nty)y" "dy < [, 


y’?? f(2)| dm(z) < +00. 


If T < +00, the left hand side is finite if and only ifk > lie. ifr > 2, a 
condition which is thus necessary for the existence of functions f # 0 satisfying (2). 
If T’ = +00, the integral in y diverges for t < 0, so that convergence also supposes 
that f(t) =0 for all t < 0 since f is continuous. Hence finally, 


+coo | 
(16.6) f(z)= / f(t)e(tz)dt for all f C Hi(P). 
) 

For r > 2, as we have already checked that the functions (z — W)~? are in Hi(P) 
for all w € P provided p > r, it finally follows that H;(P) 4 {0} if and only if 
r>2. 

To go further than these results of Chap. VIII, let us start by finding upper 
bounds for f(t) and f(z) for f € H4(P). Since y"/? f(z) is integrable [with respect 
to dm(z), as usual in this context] and e(tz) is bounded by 1 in P for t > 0, one 
can compute 


[v2 seet@am(e) = fy? ary f fle + iy)exp(—2nty)e(-tx)de = 


= (4nt)'~"/?P (r/2—1) f(t). 
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The left hand side being bounded above by”” ||f;||1 = fy? |f(2)|dm(z), we suc- 
cessively deduce that 


Bo? | FO] <er(r) IMF» 


(16.7) [F< f |F@ete)| ae < er) fel, f #7? exp(—2nty)a, 


(16.8) uy? |f(2)| < car) Wells 


with constants 


- r/2-1 5 
ea(r) = pone car) = (2n)-"/2L(r/2)ea(r).. 


Since y’/? f(z) is bounded on P, one can also deduce that 
(16.9) Hi(P) CHe(P) forall p>1. 


For integer r, |fr(g)| = |J(g;i)~" f(z)| = y"/|f(2)|, so that f, is bounded on G. 
Similarly, 


(16.8’) IIFrlloo S e2(7) Fella 
and, instead of (9), 


(16.9’) Hi(G) CH2R(G) for 1<p<-+oo. 


(8’) also shows that, in H/(G), convergence in mean implies uniform convergence 
on G and not only on every compact subset. 


Fig. 3. 


Exercise 1 [direct proof of (8)]. (a) Applying Cauchy’s formula to the contour 
ABCD above and making t tend to +00, show that 


°’ T use this convenient notation despite the fact that the function f,(g) is not 
well-defined for non-integer r. 
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iia f(w) f(w) 
ae - —_ W— a 7 — W— ge 


for 0 < vu’ < Im(z) < v”, the integrals being convergent. Let c’ and c”’ be the values, 
independent of v’ and v’, of these two integrals. (b) Let B be the horizontal strip 
vy <Im(w) < v¥. Show that, for Im(z) < vf < vy, 


/ Pw) mw) < = NFolla 


w- 2 vy —Im(z) 


Computing the left hand side in terms of ¢”, vi’ and vy, show that c’’ = 0 and 
that 


(16.10) Qni f(z) -|/ ae LO) aw for 0< vu < Im(z). 


(c) For 0 < T < Im(z), compute the integral 


| BEE a) 


v<T We 
and deduce that y"/* f(z) is bounded on P. 
For p = 2, (9) proves the convergence of the integral 
Wel = fv te) ame) = fy tary | fle + iy) ae 
for all f € Hi(P). However, for given y, f(a+iy) is bounded by (8), and integrable, 


and so is in L?(R)M L1(R). Its Fourier transform, namely f (t)e(ity), is, therefore, 
in L?(R). Plancherel’s formula then shows that 


[vr s@ramte) = [fy exp(—anty |f(0]? atary = 
= HE y” | exp(—47y) [fe] erata’y 7 


= (4m)! P(r — 1) f [fof erat. 


This calculation is justified by LF since it only concerns positive functions. It follows 
that, for all f € Hz(P), 


(16.11) pe 


which improves (7) but without characterizing the functions f (see theorem 15 
below). Finally, the identity 4(f|g) = ... shows that, more generally, for f,g € 
H;(P), 


fp) at < +00, 


Cree i. vy" F(2)g(@am(2) = 


(16.12) where c3(r) = (4r)' "I'(r — 1). 
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(ii) The spaces H® of the unit disc. We saw at the end of the previous n°, whose 
notation we keep, that associating the function 


faQO=IE) (6O = sO: IO Ssie 77) 


on the unit disc D to every function f on P gives 


y/7 f(z) = (1-16?) fod), 


up to a constant factor. So, for p < +00,the space H?(P) is transformed into the 
space H?(D) of holomorphic functions on |¢| < 1 satisfying 


(16.13) fle ~|¢P)"? o(¢ )|’ dm(¢) < +00 


where dm(C), proportional to (1 — |¢|?)~*dédn, is an invariant measure on D. In 
polar coordinates, this becomes 


16.13") / | pO 8) dead < 4.00, 


where integration with respect to p is extended to (0,1). As 


1 2 1pr—-2 
fa-8)} odo < +00 => pr > 2, 
0 


for all r > 2/p, integer or not, H?(D) contains every function y(¢) bounded on 
the unit disc. Coming back to the half-plane, this means that, for r > 2/p, the 
space H?(P) contains every function f(z) such that (z + 1)" f(z) is bounded, for 
example the function (z +7%)~", a result that can be easily verified directly. Hence 
H?(P) A {0} for all real exponent r > 2/p. 


This argument supposes p < +00. For p = +00, condition (13) is replaced by 


(16.13") sup (|1 — ¢|?)"”" |p(@)| < +00, 


whence H?°(D) # {0} for all r > 0, including r = 0. 

Conversely, let us suppose that H?(D) 4 {0}. Though the operator L,-(g) is not 
well-defined for all g € G’ if r is not an integer, for all yp € H2(D) and allt € R, 
the function y[e(t)¢] is clearly still in H?(D). This gives a representation of the 
group of rotations on H?(D), which is obviously continuous and isometric. It can 
be decomposed into one-dimensional representations,*® so that there are functions 
yp #0 in H2(D) such that 


y [e(t)¢] = e(t)" p(¢) 


for some m € Z. Since vy is holomorphic, this requires y(¢) = c¢™ and m > 0. 
Substituting in (13’) gives 


°8 This was shown (for every compact group) at the end of n° 29, (ii) of Chap. XI 
for unitary representations, but the proof and the result are easily generaliz- 
able to all representations (H,U) on an arbitrary Banach space: use operators 
f U(k)x(k)dk, where x varies in the set of characters of K. 
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Toe 
ftps 
As m > 0, by (13”), the converse reduces to the condition r > 2/p if p < +00, and 
to r > 0 if p= +00. As a result: 


Theorem 14. The space H?(P) is non-trivial if and only ifr > 2/p when p < +00, 
orr > 0 when p= +00. 


We had already obtained this result for p = 1 using Fourier transforms, but less 
easily. 

When r is an integer, one can use the representation L, of G’ on H?(D). For the 
matrix k’ = diag(e(t),e(—t)) which corresponds to k(t) € K and for y(¢) = am¢™, 


Lr (R') 9() = J (RG) (RG) = e(t)" Ff [e(2e)¢] = et)" (6) 


or, coming back to H2(P), L,[k(t)~*]f(z) = e(t)?"*" f(z). So setting n = 2m +r 
and Xxn[k(t)] = e(nt) in accordance to (15.13), 


(16.14) Lr (k") f=xn(k)f with n>r, n=r (mod2). 


For all n of the form indicated, (14) has a unique solution, up to a constant factor. 
By (15.41’) for y(¢) = ¢™, it is the function 


zZ—7t a(n—r) 1 1 
(16.15) f2=e@r+y™ (5) = (z—4)2OM (2 $i 3), 


As was seen above, the functions f € H,(G) represented on the unit disc by a 
monomial in ¢ are characterized by a character x of K such that 


f(ka)=x(k)f(@), te xe f= f. 


For all f € H,(G), the function y * f trivially satisfies this condition and on D 
obviously corresponds to one of the terms of the power series of fp(¢), or to 0. 
Hence 


f(x) = Sox * f(z) for all f €H,(G), 


the series converging normally on every compact subset of G as does the power 
series of fp on every compact subset of D. 

This argument also shows that the subspace of solutions of x * f = f has 
dimension 0 or 1 for all y. For vy = xr, it is obviously generated by the function 
a, which on D correspond to the function 1. It is, therefore, characterized by the 
relations 


DrEH(G), xXr*@r=Wr, @r(e)=1. 
For f € H,(G), let us then consider the function 
xt fle) = ff f(kea)xc a. 


It trivially satisfies the first two conditions imposed on @w,, and so is proportional 
to it. For x = e, the previous integral equals f(e) since f has weight r. Thus it 
equals f(e)@,(x) for all x. Replacing f by L(y~')f € H-(G) gives 


i: f(yka)x(k)dk = f(y)ar(x) for all f € H,(G). 
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As k varies, the images kx(0) of the point 0 € D under ka describe a circle centered 
at 0 and passing through x(0), and the yka(0) describe the image of this circle 
under y, thus a circle whose interior contains the point y(0). So the previous formula 
essentially means that integrating fp(¢) along this circle gives the value of fp at 
y(0). As for Baron Cauchy, he did not only integrate over circles... 

Thus, all this is simply a clever re-writing of standard and easy properties of 
power series. The group G is too simple for the power of these methods to be gauged. 
If G is only semisimple, with a maximal compact (non-commutative) subgroup K 
so that G/K has a complex analytic structure, then one should refer to Harish- 
Chandra’s marvelous and far less than semi-simple articles on Discrete series for 
semi-simple Lie groups which can be found in his Collected Works (Springer). 


(iii) A theorem of Paley-Wiener type for H2(P). Fourier transforms have en- 
abled us to associate a function f (¢) with complex Fourier transform f to each 
f € Hz(P) In section (i) we obtained properties, but not any characterization, 
of these functions. One can also associate Fourier transforms i (£) to functions 
f € H2(P), and this time characterize them fully, but proofs are more difficult. 


Theorem 15. For all f € H2(P), r > 1, there is a measurable function f(t) zero 
fort <0, satisfying 


(16.16) | \fof era < +00 


and for which 
+oo | 

(16.17) f(z) =} f(t)e(tz)dt. 
0 


— sc it 
(16.18) [v1@a@am(e) = amr —y [ fngperrat 
for all f,g © H2(P). 


Conversely, integral (17) converges for all solutions of (16) and belongs to 


H?(P). 
Since ff y"*| f(z)|?dxdy < +00, there is a null set N(f) such that 


[ite +inP ae <+oo for y¢ N(f). 
The function fy(x) = f(a + %y) then has a Fourier transform 


(16.19) f(t) = Li.m.? f(a + iy)e(—ta)da 


—n 


in L?(R), and Plancherel’s formula shows that 


: o 4y|2 
fite+impac= [Awa 
Let us show that there is function f(t) such that, for all y ¢ N(f), 


(16.20) fy(t) = exp(—2rty) f(t) for almost all te R. 


To this end, we set 
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and show that 


(16.21) lim [fan (t) = fom(t)] = 


for alla,b >O andteER. 
Lemma 1. For all functions f € H?(P), p < +00, 


ia f(a +iy) =0 


uniformly on every compact subset of Ri. 


Indeed let us suppose that z stays in a strip B:0<a<y<b< +oo and 
consider a strip B’:0 <a’ < y <b’ < +o0, with a’ < a and b < b’. Let K’ c B’ 
be the set of ¢ = € +77 such that 


Seca ly- a Sey 


and set K = K' B. There is [Chap. VIII, n° 4, (iv)] a constant cp such that 


OP Sep fi |f(G)lPam(G) for all ze K 


K’ 


and all holomorphic functions on B’. Letting the same horizontal translation z +> 
z+uact on f, K and Kk’ replaces the left hand side with f(z + u) and on the right 
hand side, K’ with K’ + u. Thus 


iret iy) Som f FQ dm(0) for a<y<b. 


K'4+a 
As the integral f | f(¢)|Pdm(¢) extended to all of the strip B’ converges, the right 
hand side clearly tends to 0 as |z|increases indefinitely, qed. 

Relation (21) readily follows from the lemma. Integration of f(z)e(—tz) along 
the rectangle bounded by horizontals a and 6 and verticals n and —n shows that 
fan(t) — fo.n(t) is indeed the difference between the integrals of f(z)e(—tz) along 
the verticals n and —n of strip B. As |e(—tz)| = exp(2znty) is bounded on B, 
the product f(z)e(—tz) converges uniformly to 0 as z tends to infinity in strip B, 
whence (21). 

We can now return to the proof of (20). For all y ¢ N(f), relation (19) can also 
be written 


fy (t)= Lim.” fun(t) exp(—2nty) ; 


for a,b ¢ N(f), the difference fa(t) — f(t) is thus the limit in mean of a sequence 


converging everywhere to 0, and so (Riesz-Fischer) fa(t) = fo(t) for almost all t, 
whence (20). 
This being so, (Plancherel and LF) 


if = ff v*@Pavdy = f yPay [| FeO[ exe(—anty)at = 


(16.22) - / 


fol at fy exp(—4rty)d*y. 
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In theory, these are upper integrals. The convergence of the integral in y requiring 
t > 0, it follows that 


f(t) =0 for almost all t < 0 


[and no longer for all t < 0 as was the case in Hz(P)], then that 


(4m)! "P(r — 1) il |Fo| erat = IIJ1B < +00, 


which proves (16), as well as (18) for f = g, and so for arbitrary f and g by 
Af, 9) = ++ 

As f(t}t"-”/? and t H+ e(tz)t~¢-"/? are square integrable over t > 0 for 
Im(z) > 0 (and r > 1), integral (17) obviously converges. By (20), the Fourier 
transform f,(t) = f(t)|e(tz)| is integrable for y ¢ N(f); since x +4 f(x + iy) is 
continuous, the Fourier inversion formula 


Fle+ iy) = f fultjette)ae= f f(eyetteat 


is justified for y ¢ N(f). But the second integral is a continuous function of z 
(dominated convergence). So (17) holds for all z. 

Conversely, starting from a solution of (16), the same arguments show that 
integral (17) converges, represents a holomorphic function and (22) that f € H?2(P), 
qed. 

Multiplying the functions f (t) by t0-")/2 | the following statement follows: for 
all r > 1, the map associating the function 


ZR> [oP eetenat 


to all y € L?(R,) is an isomorphism from L?(R+) onto H2(P). The typical Paley- 
Wiener theorem, which they in effect proved and which is given by all authors, 
corresponds to the case r = 1, excluded here because H?(P) = {0} for r < 1. It 
characterizes complex Fourier transforms of y € L?(R1). The reader will easily find 
the statement and proof based on previous arguments. 


(iv) The kernel function of H?(P). The (complete) Hilbert space H?(P) is non- 
trivial if and only if r > 1. On it, convergence in norm implies simple (and even 
compact) convergence. So, for all z € P, there is (Chap. XI, n° 19, theorem 31) a 
function K. € H;(P) such that 


(16.23) f(z) =(f|Kz) for all f © H2(P). 
It can be determined in several ways. 
Setting 
K,(z,w) = K.(w) for Im(z)>0, Im(w)>0, 
gives 
(16.24) Fla) = fv" Key) flw)dm(w) (w= w+ iv). 


where K,. is the kernel function of H?(P), a notion invented by Stefan Bergman in 
the 1920s- 30s and applicable to any Hilbert space whose elements are holomorphic 
functions, as well to many other situations (elliptic partial differential equations). 
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For given z, K;(z, w) is clearly a holomorphic function of W. Relation (23) shows 
that (K.|Kw) = Kz(w). Thus 
(16.25) K,(z,w) = (Kw, Kz) = Kr(w,z), 
so that K;(z,w) is holomorphic at z for given w. 


First method. We now consider the representation L, of the subgroup B+ on 
H?(P) defined at the start of this n°. It is obviously unitary. By (15.6), replacing 
f by L,(b~*) for b € By gives 


J(b, 2)" f (bz) = (Lr(b)* f|Kz) = (f|Lr(b) Kz) . 
This can also be written 
(f|Koz) = f(bz) = JG;2)" (f|Lr(b) Kz) - 


Comparing with (23) for bz and taking into account the fact that K is determined 
by (23), it follows that 


16.26) Koz = J(b, 2)’ Lr(b) Koz, 

which is equivalent to 

16.26’) K,(bz, bw) = J(b, z)"J(b, w) K,(z,w) 

or to K,(tz+€,tw+&) =t~-"K,(z,w) for all t > 0 and all € € R. Setting H(z, w) = 


z—W)’K,(z,w) gives a holomorphic function in z and antiholomorphic in w such 
that H(tz+€,tw+€) = H(z,w). So 


A(z,w) = H{i,(w—2)/y] , 


an expression which, for given w, must be holomorphic in z. However, H(i, w) is 
antiholomorphic in w. Hence its differential with respect to w is proportional to 
dw. For given w, the differential of the function z +> Hi, (w — x)/y] with respect 
to z is thus proportional to d[(w* — x)/y] and not to dz if H(t, w) is not a constant. 
Thus the only possibility is that H(i,w), and so H(z,w), is constant. Therefore, 


(16.27) K,(z,w) = cr (z-W)" 
and 
(16.28) eis i (esas Fai Alay: 


By astrange coincidence, function (27) appears among those which, in Chap. VIII, 
n° 12, (iv), eq. (12.23), have proved useful to show that H}(P) 4 {0} if and only 
if r > 2: indeed we had shown that the function (z — @)~? is in Hi(P) if and only 
if both Re(p) > 1 and Re(p) > r/2, hence for p= r > 2. For r > 2, function (27) 
is, therefore, in H}(P) and not only in H?2(P), and as H;(P) C H2(P), it follows 
that, for r > 2, relation (23) applies to all f € Hi(P) — and to all f € H2(P) if 
r>1. 

Second method. Show that, for all z € P, there is a function g. € H2(P) such 
that 


(16.29) G-( = t', ‘e(tz) 
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and that f(z) = f{ f(w)gz(w)dm(w), up to a constant factor. Calculating g.(w), 
show that 


(16.30) K,(z,w) = —=(z-@)7 


Third method. When r is an integer, it is also possible to work in G. To do 
this, we replace H?(P) with H?(G) = H,(G)N L?(G). As the space of holomorphic 
functions of weight r, it is closed in L?(G) and y +> y(z) is a continuous linear 
functional for all « € G. Hence there is a unique function w, € H2(G) such that 


pe) = (ylwr) 
and so, replacing y by L(a~')y, 


(16.31) pl) = (E(2)lwr) = (IL (@)wr) . 
For y = wr, (31) becomes 
wr (x) = (wr|L(@)wr) , 


which shows that w, is a function of positive type on G [Chap. XI, n° 30, (iii)], 
hence satisfies w,(a~') = w,(x). Then, by (31), 


g(x) = [ ewisr (@-ty)dy = io (y~'2) dy, 
in other words, 
(16.32) p=Pr*ur. 


This is relation (24) for G. Once again everything is easier on G than on P or D. 

Furthermore, (27) follows readily from this. Indeed, the function w, = W, sat- 
isfies x, * Wr = w,. Relations (14) and (15) of section (ii) then readily show that, 
the corresponding function on the unit disc is the monomial of degree 0 in ¢, hence 


that, for a constant cr, wr corresponds to cr(z +%)~". 


(v) The holomorphic discrete series of irreducible representations of SL2(R). 


Theorem 16 (V. Bargmann). For all reals r > 1, the representation Ly of B+ 
on H2(P) is irreducible. For integer r, the unitary representation Ly of SL2(R) on 
H?(P) is irreducible and square integrable. 


Let us first suppose that r is an integer. To prove irreducibility, it suffices to 
show that any operator A in H2(G) commuting with left translations L(x) is a 
scalar. It then amounts to showing that the function w’ = Aw, is proportional to 
wr, since relation Aw, = Aw, implies AL(x)w, = AL(x)wr. Thus the result follows 
since linear combinations of L(x)w, are everywhere dense in H2(G). However, as 
wr is Hermitian symmetric, it satisfies 


(16.33) wr (kgk") = Xr(k)wr(9)xr (F') 


and so L(k~!)w, = xr(k)w,. As A commutes with all L,(k), the function w’ also 
satisfies (33). However, for all characters y of K, the subspace of f € H,(G) satis- 
fying x * f = f has dimension 0 or 1. Hence Aw, = Aw;. 

Finally, the representation is square integrable since its coefficient (w,|Lr(x)wr) = 
w,(a) is in L?(G) (Chap. XI, n° 32). 
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In the general case, this argument no longer holds in this precise form. But it 
is still possible to use operators 


(16.34) Ly (67) fa) =tf (Pz+u) for b= € ey , £>0 


and to show that any operator A in H?(P) commuting with all L,(b) is a scalar. 
As in the computation of the kernel function K,(z,w), by (23), we associate the 
function 


Ka(z,w) = (AKw|Kz) = AKw(z) 


to A . It is holomorphic in z and antiholomorphic in w. By (26) and equality 
AL,(b) = L,(b)A, the function K,4 satisfies the same functional equation (26’) as 
K,. Now, we have shown that it determines K, up to a constant factor. Hence 
(AK.|K.) = \(Kw|K-), and so A = A1 since the functions K, generate H?(P), 
qed. 

Exercise 2. Associating the function 


(A) = ATP? FO), 
to each f € H2(P), show that the question reduces to L?(R+) and operators 
(16.35) p(A) + e(ud)t 7! (t77A) . 


Conclude that all representations L, of By on H?(P) are mutually equivalent. 
Exercise 8. Associate the function 


{OV SIGi) FO; eer? Fay. 


to f © H2(P). Show that f, = Jy where J; is an isomorphism from L?(R4+) onto 
a minimal closed invariant subspace of the left invariant representation of B+, up 
to a constant factor. The irreducible representations of By on these subspaces are 
equivalent to representation (35). (The group By has other irreducible representa- 
tions: those obtained by replacing these invariant subspaces with their conjugates, 
as well the one-dimensional representations 
beo t?, oeER. 

These are the only ones). 

Exercise 4. Fourier transforms on H?(P) enable one to reduce operators of B 
to formulas (35) in L?(R%.). For integer r, the group G acts on H?(P), hence on 
the functions y(A). Calculate the effect of the matrix w on these functions. This 
will give an integral operator whose kernel can be written using Bessel functions; 
see n° 22, (iii). 


Bargmann did not bother with arguments from functional analysis. By making 
the Lie algebra of G act on an arbitrary unitary representation of G, he obtained 
symmetric operators not defined everywhere which he assumed to be essentially 
self-adjoint (correct but not proved) and deduced the classification of irreducible 
representations of G using simple algebraic calculations that will be given in n° 29. 
He thus calculated explicitly the coefficients using differential equations whose solu- 
tions are special functions whose properties were well-known (integrable formulas, 
orthogonality relations, asymptotic behaviour, etc.). From this he deduced that 
some representations can be realized on spaces of holomorphic or antiholomorphic 
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functions®® and that, with respect to a privileged orthonormal basis (the functions 
¢” in the unit disc interpretation), the coefficients of the representation are square 
integrable and pairwise orthogonal. Some of his methods will be presented in § 8. 
They converted Harish-Chandra to group theory; his generalizations are extraor- 
dinary, and Bargmann’s physicist arguments were soon justified. But his method 
remains the only one®° and, moreover, applies to the universal covering of G. In 
n° 32 du Chap. XI on Bargmann’s orthogonality methods, the steps to be followed 
in this case were clearly set out. 

Exercise 5. Let (H,U) be a unitary representation of G. For any character y of 
K, let H(x) be the subspace of a € H such that U(k)a = y(k)a, so that H is the 
Hilbert direct sum of these H(x). Assume there exists y and a € H(x) such that 
H(x) is one-dimensional and that the elements U(ax)a generate H. Show that the 
representation is irreducible and that the function w(x) = (U(x)ala) satisfies the 
functional equation 


w(e) f wolcky)x(R)ak = ola )u(y). 
(Any irreducible representation of G is of the previous type: n° 30). 


(vi) Solutions of the equation f * wy = f. We showed at the end of section (iv) 
that, for integer r, f = f *w, for any function f € H2(G) = L?(G)NH,(G). 
Actually, the map P : f -> f *w, on the space L?(G), is the orthogonal projection 
operator onto the closed subspace H2(G). Indeed if f’ is the projection of f on 
H?(G), then 


f * wr(x) = [ Howey = (f|L(a)wr) = (f'|L(a)wr) = f(a) 


since f’ satisfies (32), whence the result. The fact that P is a projection can also 
be deduced from the Bargmann orthogonality relations. 

Hence formula (32) characterizes functions f € L?(G)NH,(G), but we will need 
a similar result for functions about which all that we know is that they are locally 
integrable and for which, the expression f * w, is well-defined. For example, if I” is 
a discrete subgroup of G, this is the case for all f € L?(I’\G) for any p. 


Lemma 2 (integer r > 2). Let f be a locally integrable function such that f xw, = 
g is defined. Then®' g €H,(G), and f is in H,(G) if and only if f *w, = f. 


°° A function f(z) is antiholomorphic if its conjugate is holomorphic. Replacing 
the functions f € H?(P) by their conjugates and making obvious changes to the 
definition of L,(g) leads to another series of representations on which theorem 
16 can be applied. “Antiholomorphic” representations are not equivalent to 
“holomorphic” ones. 

V. Bargmann, Irreducible unitary representations of the Lorentz group (Annals 
of Math., 48, 1947, pp. 568-640) ; see for example Serge Lang, SL2(R) (Addison- 
Wesley, 1975 or Springer, 1985). In his Panorama des mathématiques contempo- 
raines where Dieudonné proposes a list of “initiators” for each major field, in 
particular non-commutative harmonic analysis, I cannot find Bargmann, neither 
Gelfand and yet, at about the same time, together with Neumark he studied 
the group SL2(C) and influenced HC as much as Bargmann. HC told me one 
day: semi-simple groups are my friends, I know them all individually. This is not 
reflected in his articles, where he always from the onset only considers the most 
general case, but... 

As f is not assumed to be continuous, this relation means that g is equal almost 
everywhere to the function g’ € H,(G). The same remark applies to the relation 


fxr =f. 


60 


61 
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We first recall [Chap. XI, n° 25, (iv)] that, f * w, is defined if and only if, for 
all p € L(G), 


(16.36) / Ip(ay) f (a)wr(y) |dedy < +00. 


The function 
(16.37) f*wr(2) = / f (wy~*) wr(y)dy = ‘| Ff (y)wr (ya) dy 


is then defined almost everywhere and is locally integrable. Associativity of the 
convolution product will be used in the cases covered by lemma 3 below. 

Let us first suppose that f € H,(G) and that f * w, exists (which is obviously 
not always true, for example if f = f, where f(z) = e*). For every character x 
of K, the function y * f € H,(G) is either trivial or corresponds to a monomial on 
D.So 


X*(f er) = (x* f) wr =x f 


and thus x « (f *w, — f) = 0 for all x, which implies the result because 0 is the 
only locally integrable function g such that x * g = 0 for all y (exercise !). 

Before proving the converse, let us show that there are functions yp on L(G) 
such that 


(16.38) Q*Wr = Wr * P= Wr. 


To this end we consider the unitary representation (H2(G), L) of G. As 


peur(e) = | o(y)ur(y~t2)dy = | (oy) (L(y) wel L(at)ere) = (L(y)wel L(at)we) 


the proof reduces to choosing y in such a way that L(y)w, = w,. However, there 
exist Yn € L(G) such that L(y~n)w, converges to w, (take a Dirac sequence). Then 


Wr = Xr * Wr = L(xr)wr = lim L(xr * Yn) wr ; 


but the function L(x, * Gn)wr = w’ € H2(G) trivially satisfies x, *w’ = w’, and so 
is proportional to w,, qed. 

Let us now come back to a function f for which f *w, = g exists. Replacing 
f with 0 outside larger and larger compact sets leads to a sequence of functions 
fn € L'(G) converging everywhere to f. Then (dominated convergence) 


(16.39) f *wr(x) = lim fn * w(x) = limgn(x) for almost all x. 
Since L’ * L? C L? [Chap. XI, n° 25, (iv), theorem 35], gn = fn * wr is in L?, and 
SO gn * Wr E H?(G) as was seen above. However the convolution w, *w, exists since 
w, € L?(G). Hence 

Qn * Wr = (fin * Wr) * Wr = fin * (We * Wr) = fn * Wr = Gn 
by lemma 3 for d\(x) = fn(x)dx. Thus the functions gn are in H2(G) and, being 
dominated by the locally integrable function 


p(x) = f |f (cy7*) |.|wr(y)|dy, 
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converge almost everywhere to g. Let us then choose some y € L(G) satisfying (38). 
By lemma 3, gn * & = fn * Wr = gn. Since y is continuous with compact support, 
lim gn * G(x) = g * G(x) as well (dominated convergence ), whence g * y(x) = g(x) 
almost everywhere. Furthermore, 


lon * B(x) — 9 * B(2)| < f |gn(u) — 9(u)|-1@0*2) lay. 


If M is the support of ¢, the integral is extended to the compact set eM~', hence 
to a fixed compact set M’ if x stays in a compact subset A of G. As the functions 
lgn(y) — g(y)|, dominated by 2p(y), tend to 0, the right hand side tends uniformly 
to 0 on A. The function g = g * @ is therefore the limit of functions gn € Hr(G) 
with respect to compact convergence, whence g € H,(G), qed. 

The associativity formula used for the convolution product remains to be jus- 
tified: 


Lemma 3. Let and v dbe two measures such that ux v exists. Then 
(16.40) A* (wx) = (Ax) eV and wx (Vv *A) = (xv) *A 
for all measures with compact support. 


We showed in Chap. XI, n° 25, (ii) that, generally speaking, the previous relation 
only involves absolute values of the measures considered and that, for positive 
measures, it holds provided 


i /| p(xyz)dd(x)du(y)dv(z) < +00 


for all p € L4(G). If \ has compact support, then clearly f p(ayz)dA(x) = q(yz) 
with some gq € L+(G). So if 4 * v exists, the upper integral 


J eevoarce) [f antuyar(e) 


is by definition of xv finite. Thus Lebesgue-Fubini follows by (40). The same proof 
holds if A * ys exists and if vy has compact support. 


The reader is likely to think that, as formula (28) holds for all gn, it also holds 
for the limit g making the holomorphy of g obvious. To justify this argument, it is 
at the very least necessary to prove a general result enabling us to pass to the limit 
in Cauchy’s formula for the functions gn: 


Lemma 4. Let U be an open subset of C and (gn) a sequence of holomorphic 
functions on U. Suppose that lim gn(z) = g(z) exists almost everywhere and that 
there is a locally integrable function q > 0 on U such that |gn(z)| < q(z) for all 
z €U. Then the sequence (gn) converges uniformly in every compact set and g is 
equal almost everywhere to a holomorphic function. 


Compact convergence being a local property, using a translation, it is possible 
to work in a disc D : |z| < r contained in U. Choosing numbers r’ and r” such that 
r<r'’ <r” assuming the closed disc |z| <r” to be contained in U, for all t z € D 
and all @ € [r’,r], 


gn(2) = / an (oe(#)) (oe(t) — 2)7* ge()at 


Using polar coordinates, we get a formula of type 
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(16.41) 9n(z) = cf gn(w)(1 — z/w) 'dudu (w = u-+ iv) 
where integration is over the annulus M : r’ < |w| <r”. This being so, 


|gn(w) (1 — 2/w)7"| < q(w)|1— 2/w|™ . 


This is an integrable function on M since, almost by definition, g is integrable over 
all compact sets [Chap. XI, n° 5, (ii)]. It is thus possible to pass to the limit in 
relation (41) for the functions gn, proving (41) for g, which is therefore holomorphic. 
The convergence of the functions gn on every compact set is obviously uniform: it 
follows from the usual upper bounds, qed. 

In fact, relation (38) replaces formula (41) and can be applied in far more general 
situations (spherical functions) where no complex analytic structure is available. 


We will also use the fact that wx f € H,(G) for all f € H,(G) and all measures 
with compact support. As H,(G) is left invariant and closed with respect to the 
compact topology, it suffices to show that, on any compact set, for any continuous 
function f on G, the function 


px f(a) = / f (yx) duly) 


is the uniform limit of linear combinations of left translations of f. Now, if x stays 
in a compact set A and if B is the support of u, then the integral is extended to 
the compact set AB~* = C. As f is continuous, for all € > 0, B can be covered by 
finitely many open sets U; such that, for y1, y2 € Ui, 


lf (yi 2) — f (yz'z)| <e fortall ceA 


(uniform continuity). Replacing the U; with pairwise disjoint Borel sets EF; C U; 
and choosing some y; € F; for all 7, we get 


ay LE 2) ~ F (0%) | all) 


for all a € A. The right hand side is < €S> |u|(Ei) < elpu|(G), qed. 


Jef) — OF (etx) ws) 
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§ 6. Modular Functions: The Classical Theory 


In all of this §, 1’ will denote the modular group SL2(Z) = G(Z), unless otherwise 
stated. 


17 — Fundamental Domain, Modular Forms 


(i) Generators of the modular group. The modular group I" contains the transfor- 
mations 


T:ize>z+1, S:zr+-—-1/z 


corresponding to matrices 


(17.1) T= (( 1) : S= € a) =w. 


We start with an essential result: 
Theorem 17. The modular group is generated by matrices S and T. 


This means that any y € I is for the form T?S?T” ... with exponents in Z. As 
S? = —1, powers of S can be omitted, if need be by replacing y with —y. 

The proof if simple. Let a, b,c, d be the entries of 7; we may assume that c > 0. 
We will use induction on c. If c = 0, equality ad = 1 requires a = d = +1, and 74 is 
then a power of T’,, up to sign. When c > 0, 


and we choose n in such a way that 0 < a+ nc < c (Euclidian division with 
remainder). The entries of y’ = T”y then satisfy c’ =c,0<a' <c. As 


0 -1\ fa’ W\ _ (-c —d' 
1 0 cd} \d vb}? 
for 7’ = Sy’ = ST”, the new entry c’ =a’ satisfies 0 < ce’ < c. The induction 


hypothesis then shows that y” is a non-commutative monomial in T and S, up to 
sign. Hence so is ¥, qed. 


Corollary 1. A function f(z) satisfies 
f(z) = Iz)" f(z) for all 7 € GZ) 
for some even integer r if and only if 
(17.2) f2@+)=fl), — f(-1/z) = 2" f(z). 
It goes without saying that for r odd, f is necessarily 0 since —1 € I. 


(ii) Fundamental domain. The second result is about the explicit determination 
of a fundamental domain for I’ acting on the upper half-plane P. This is a classic, 
but no longer so useful exercise (seemingly resolved by Gauss in his secret papers): as 
such precise results are not available for arithmetic subgroups of general semisimple 
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groups, weaker results have to make do, even for classical groups studied by Hermite, 
Minkowski, Siegel, etc. 
The method is the same as that used for the group of the function 6(z) in § 2, 
n° 5. For given z € P, 


17.3) Im(yz) = y/|cz + d\’, 


which enables us to choose y in such a way that Im(yz) is maximal. As integral 
translations T” : z+ z+ leave Im(z) invariant, 7 can be required to also satisfy 
Re(yz)| < 4. But, our choice of 7 implies 


Im(yz) > Im($z) = Im(-1/72) = Im(7z)/I72))’, 


so |yz| > 1 follows. Finally, any z € P can be mapped by I to a closed subset F’ of 
P defined by the inequalities 


1 
(17.4) F:lel<5,  |al21. 


By the way, note that the invariant measure on F is finite as is that of any subset 
of P contained in the strip |z| <M, y>c>0. 

To show that F is really a fundamentally domain for I in a strict sense, it is 
also necessary to check that two points z’, 2’ of F cannot be equivalent mod I if 
one of them is in the interior of F. So suppose that 2” = yz’. The entry c of 
can be assumed to be > 0 (otherwise, replace y by —7y), so can y” > y’ (otherwise 
permute z’ and 2”). By (3), the relation y” > y’ can be written 


cz" | dj” = (ca’ | d)” | ey” <1 


and implies |cy’| < 1. However it is sort of obvious that y’ > sin 7/3 for all 2’ € F. 
Hence c? < 4/3, and soc =0 or 1. 

If c = 0, then z” = z’ +n, which requires z’ and 2” to be located on the 
intersections of the vertical sides of F with a horizontal, y then being z > z+1 
or z+>z-1. 

If c = 1, then |z’ + d| < 1, which requires d € {—1,0,1} as follows from the 
images of F under integral translations. Hence there are again two possible cases. 

If d = 0, (4) shows that |z’| < 1, hence that |z’| = 1, and 


(6 2)-G969). 


Thus 2” = a-— 1/2’. For |z’| = 1, the point —1/z’ being the symmetric of z’ with 
respect to the imaginary axis, for a — 1/z’ to be in F a must be 0 unless z’ and 
z" are the two vertices®? exp(27i/3) = j and exp(mi/3) = —j of “triangle” F, in 


which case a = —1 or +1 may be appropriate according to the case. 
If |d| = 1, for |z’| = 1, |z’ +d] < 1 is possible only if either z’ = j and d= 1, or 
z' =—j and d = —1, as indicated above. 


To summarize: 


Theorem 18. Let F be the subset of the half-plane P defined by inequalities 
(17.5) jz| > 1, jal] <a. 


°°? In my youth, the notation j = exp(27i/3) was standard in France. Freitag and 
Busam use the letter 0, Koblitz the letter w, etc. An international commission 
should be convened. 
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Then P =|J7(F) and two distinct points 2’, 2’ € F can be mapped onto each other 
byay€T only if they are on the boundary of F and symmetric with respect to the 
imaginary azis, i.e. if Im(z’) = Im(z”). For Im(z) > 1, Im(yz) > 1 if and only if 
YET =INB. 


The arguments that have led us to case (b) show a bit more. We will say that 
a point z € P is a fized point of I if there exists y 4 +1 in I’ such that yz = z. 
The proof of the theorem shows that the only fixed points in F' are (a) z =i = Sz, 
(b) z = exp(27i/3) = j = STz = T~'Sz, (c) z = exp(mi/3) = -7 = ST~'z = TSz. 
As the other fixed points can made mapped onto F' by I’, they are the images 
under I’ of the points found. It is even unnecessary to consider both j and —j since 
—j = Tj. Hence if the fixed points are divided into classes mod I’, there are only 
two such classes: that of 7 and that of 7. Moreover, if we do not distinguish between 
matrices y and —y which define the same transformations on P, the stabilizer of 7 in 
I is the subgroup {1, S} of order 2, whereas that of j is the subgroup {1, ST, T~!S} 
of order 3. 

Exercise 1. Let I be a discrete subgroup of a connected locally compact group 
G. An open subset 2 C P will be said to be an open fundamental set for I’ if 
(a) P=U 2, and if (b) there are finitely many matrices y such that yQN 2 4 0. 
Show that I" is generated by the elements yy such that yQ#2 (show that, if I” 
is the subgroup generated by these elements of I’, the union of yQ for y € I’ is 
both open and closed in P). [In generalizations of the theory to semisimple groups, 
only very rarely is it possible to construct genuine fundamental domains F’. Open 
fundamental sets are just as useful. In the case of the modular group, one can 
choose any set defined by inequalities |z| < a, y > T with sufficiently large a and 
sufficiently small T > OJ. 

Exercise 2. Let I’ be a subgroup of finite index n in the modular group I and 
I'y,...,P°' Yn the n cosets mod I” in I’. Show that F’ = UF is a fundamen- 
tal domain for I’. What about the group of the function 6(z)? What about the 
subgroup®? I’(2) of matrices = 1(mod 2)? 


(iii) The classical definition of modular forms. The simplest version of the clas- 
sical theory with respect to [ = G(Z) consists in studying meromorphic functions 
satisfying 


(17.6) f(yz) = F(z)" F(z) 


for a given even integer r. As will be seen, to get results, these functions must not 
increase too rapidly as Im(z) increases indefinitely. 

First note that the poles of f in P can be divided into classes mod I’. By the 
previous theorem, two poles z,z’ with imaginary parts > 1 can belong to the same 
class only if there exists n € Z such that 2’ = z +n. The set of poles contained in 
the fundamental domain F' is a discrete subset of F’, which does not stop them from 
accumulating to infinity. To get rid of this possibility, f must first be required to 
be holomorphic for sufficiently large Im(z), so that f only has finitely many poles 
in F or, equivalently, that the poles of f in P are divided into finitely many classes 
modI’. 

As f(z+1) = f(z), there is then a Fourier series expansion 


(17.7) f(z) = S| ane(nz) , an = $ fla)e(—nz)ax 


°3 For a diagram representing a fundamental domain for I'(2), see Neal Koblitz, 
Introduction to Elliptic Curves and Modular Forms (Springer, 2d. ed., 1993), 
p. 106. 
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valid for sufficiently large Im(z).°* This expansion is also a Laurent series in the 
variable ¢ = e(z) since the map z +—> e(z) transforms every half-plane Im(z) > 
T > 0 into a disc centered at 0. Note that |e(z)| = exp(—27y) tends to 0 as Im(z) 
tends to infinity. This being so, the second condition that f must be required to 
satisfy is that there be only finitely many terms of degree n < 0 in this Laurent 
series, in other words that 


(17.8) f(z) =O (e?7"") » y—+oo 


for some integer p > 0. If k is the smallest integer for which a, #0, f will be said 
to have a pole of order —k at infinity if k < 0, and a zero of order —k at infinity if 
k > 0. For obvious reasons, we set k = Voo(f). 

These two conditions define the modular forms of weight r in the precise sense 
of the word. For r = 0, the expression modular functions is used instead. These are 
analogous to elliptic functions for the group I’. 

If f and g are modular forms of weight p and q, the quotient f/g is clearly a 
form of weight p — q. 

Modular forms holomorphic at infinity are characterized by the fact that they 
are bounded on {Im(z) > T} for sufficiently large T, because a Laurent series 
converging for 0 < |z| < R is a power series if and only its sum is bounded in the 
neighbourhood of 0. Similarly, it would suffice to have an upper bound 


f(z) =O (v*) for large y, 


since were this the case, then 


L: 
lanl < f [s(e)e(-n2)|dz < My" exp(2nny) 


for large y, an expression tending to 0 for all n < 0. For such a function we denote 
the constant term of the Fourier series of f by 


(17.9) f(co) = ao = lim f(z). 


Modular forms that are holomorphic everywhere, including at infinity, are said 
to be entire. They are the analogues of the theta functions of theory of elliptic 
functions. The Fourier series of an entire form f converges and represents f for all 
z € P and, by definition, only involves exponentials e(nz) with exponent n > 0. 
This shows that any entire modular form is bounded on every half-plane Im(z) > 
T > 0. 

For any entire modular form f, the asymptotic behaviour of the function f (iy) 
as y tends to 0 or +00 can be easily obtained. For large y, f(ty) ~ an exp(—27ny) 
where n > 0, which is much better than an asymptotic expansion. As y tends to 
0, relation f(ty) = (—iy)~" f(i/y) shows that f(iy) ~ (—iy)~"an exp(—2an/y). In 
particular, 


(17.10) fliy) = 


This result will enable us [n° 23, (i)] to define the Mellin transform of f(iy), a 
Dirichlet series with a simple functional equation. 


°4 Chap. VII, n° 17: if f(z) = f(z+1) is holomorphic on a strip B : a < Im(z) < 8, 
the Fourier series of f converges normally on every compact subset of B and 
represents f on all of this strip. 
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Entire modular forms such as Hecke Spitzenformen (Spitze = cusp) f(oo) = 0, 
are also called parabolic or cusp forms. As She (2) tends to infinity, f(z) tends to 0. In 
fact there is an upper bound |f(z)| < M(T *¥ on the half-plane Im(z) > T > 0, 
which replaces (10) by 


O (eo 2"5) as y—> +00, 


17.11 iy) = 
( ) Fy) O (ye) as y—>0. 
This result makes the integral defining the Mellin transform of f (iy) convergent for 
all s. 

As was seen in n° 15, each solution of the functional equation f(yz) = 
J(7; 2)" f(z) can be transformed into a function 


(17.12) f-(9) = J(934) " f(gi) 


on G, invariant under left translations g + yg. f is holomorphic if and only if f, 
belongs to the space H,(G) of n° 15, (ii). The set of these functions will be written 
H,(I'\G). We recall the formula 


(17.13) lf) =9"71F@)1- 


(iv) Hisenstein and Poincaré series. The most obvious way to construct left 
T-invariant functions on G is to use the series y" (g) = > y(yg) of Chap. XI, n° 15, 
(vii). If p € L*(G), the series converges almost everywhere, its sum is in L+(I’\G) 
and 


(17.14) is eda = f ela)as. 


As here we only holomorphic functions interest us, we choose y to be a function in 
L'(G)NH,(G) = Hi(G), hence of the form f, where f € H}(P), the space defined 
in n° 16, (i). By n° 15, (ii), y? is then a function of weight r associated to the 
Poincaré series 


(17.15) P,¢(z) = D_ J(732) = UE 


I 


As these are holomorphic functions, convergence in L implies compact convergence 
— details about this point will be given in a more general framework in n° 21, (i) — 
and the next theorem 19 will show that these series are parabolic forms. All this 
supposes that Hz(G) 4 {0}. Sor > 2 and r is even, and thus r > 4. In fact, to have 
any chance of obtaining sums that are not identically zero, one needs to assume 
that r > 12, but every parabolic form will be shown to be a Poincaré series later 
in a more general framework. 

Another type of series is obtained by grouping together the terms in sum (15) 
belonging to a same coset yUs, where Un = UNT = U(Z) is the group of 
translations z > z+ n. This gives 


(17.16) Prlz)= So Iz)" DS f(n t+). 


Uso\P Z 


But for any f € H7(P) there is a Poisson summation formula 
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S\ fletn) => f(nje(nz) 


n>0 


[n° 16, (i)]. Hence, at least formally, we find 


(17.17) Py(2) = S~ f(n)Enn(2) 

with Poincaré-Eisenstein series 

(17.18) Exn(Z) = Ss J(y3 2) "e(n.yz) . 
Uoo\l 


They converge for r > 2 and n > 0 and again define parabolic forms. In fact, like 
Poincaré series, the forms of weight r corresponding to them on G, namely 


Exn(9) = oe Cr.n(9)” ’ 


where summation is mod U. and where 
ern(g) = J(g;t) “e(nz), 
are integrable mod I’. Indeed 


[ glennl)l dg [ (lt lento) = if _lern(o)l = 


Uoo\P Uso \ 


= i) y"? exp(—2rny)dm(z) = 
Uso\P 


= if yy"? exp(—2rny)d*y < +00 
+ 


since n > 0 and r > 2. 

For n = 0, this calculation falls apart but series (18) still converge. The coset 
Usoy of a matrix y € I is the set of all y’ € I whose second row (c d) is the same 
as that of y, because relations ad—bc = a'd—b'c = 1 are equivalent to the existence 
of n such that a’ =a+ne, b' = b+ nd. Hence 


(17.19) Epo(z)= >> (eztd) "= S> Iyz)", 


(c,d)=1 Uoo\L 


a sum extended to all ordered pairs of coprime integers. As r is even, the terms in 


c dand—c —d are equal, and the functions 
1 
(17.20) Er(z) = 5 Bro(z). 


are called reduced Eisenstein series. 
They are related to the first known examples, the full Eisenstein series 


(17.21) G,(z) = > o(cz +d)" = 2¢(r)E,(z) 


of the theory of elliptic functions [n° 17, (iii)], where summation is over all ordered 
pairs (c,d) 4 (0,0). The factor 2¢(r) is obtained by grouping together the terms in 
(ne, nd) with pgcd(c,d) = 1 and n 4 0. Convergence of these series for r = 4,6,... 
is clear [n° 13, (i)], as well as the functional equation. Note that if formula (19) has 
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a clear meaning from a group theoretic viewpoint, this is not the case of (21), a 
major reason for only considering series (20). 

To show that functions (20) or (21) are entire modular forms, the most simple is 
to compute the Fourier series, a task which is anyhow essential. The usual method 
— it of course does not generalize — consists in starting from formula (27.10) 


(17.22) Si (z+n)* a oy Sn? te(nz) 


of Chap. VII, §6, valid for integer s > 2 and Im(z) > 0, and even®® for Re(s) > 1 
[Chap. VIII, (10.9)]. Applying it to the summation over d, we get 


1 7 _ (< 2m)” Pes 
5 Gre) C(r)4 Tr) pe e(cdz). 


The double series converges unconditionally, and even normally in all of the half- 
plane Im(z) = y > T > O: setting q = exp(—27T) < 1, |e(cdz)| < q“@ in this 
half-plane, so that the proof reduces to checking the convergence of the double 
series )> d’~'g°?. First summing over c reduces to the series 37 d"~'q?/(1 — q") 
whose general term is equivalent to d’~!q@ for large d. This gives convergence for 
lq) <1. 

This being settled, it is possible to group together in the series obtained the 
terms for which cd has a given value n. The coefficient of e(nz) then becomes equal 


to 
éa(n) = od" 
d|n 


a sum extended to divisors d > 1 of n. Hence the final result: 


(17.23) 5 Gr (2) or So" Soya (n)e(nz). 


We thus get a power series in e(z), which confirms that G,(z) is an entire modular 
form, for which G;(co) = 2¢(r). 
Since E,(z) = G-(z)/2¢(r), similarly 


Z)= t : (=2ni)" Or—-1(n)e(nz 
(17.24) E,(z) L+ oy Fe) 2, r—i(n)e(nz) . 


This expansion shows that the series E, are entire modular forms for which 
(17.25) E,(oo) = 1. 


There is a simpler form for the factor preceding series (23). Indeed, we have 
shown [Chap. VII, §3, eq. (11.27)] that 


2¢(2p) = (—1)?** (217)’Pbep/ (2p)! 


°° Many authors prove (22) by differentiating term by term the partial fractions 
expansion of the function cotgaz. This is not quicker than using the Poisson 
summation formula and this elegant method is unlikely to provide the result 
for complex s. This is also the reason why, even for an even integer r, I do not 
transform (—277)" into (277)". 
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where bz, is the Bernoulli number of index 2p. As here r is even, this can also be 
written 


¢(r) = I(r) or ’ 
whence 
(17.26) BOTs = Sori (n)e(nz). 


The series obtained has rational coefficients. It was found in the 19th century by 
setting g = e(z), and its computation would have taken Euler half of a quarter of 
an hour if only he had had the idea that 


Ea(z) = 14+ 240 (q+ 9q7 + 28q7 +...) , 

Eo(z) = 1— 504 (q+ 33q7 + 244g° +...) , 

Es(z) = 1+ 480 (q + 129q7 + 21189? +...) , 
E,o(z) = 1— 264 (q + 513q” + 19684q7 +...) , 
Ey2(z) = 14 a (q + 2049q° + 17719897 +...) , 


E.a(z) = 1— 24 (q + 8193q? + 1594774q° +...) . 


Needless to say that the arithmetic properties of these strange coefficients have 
made many people cogitate and continue to make several computers run... 

I will not dwell further on these series, whose study will be again taken up in 
the framework of Fuchsian groups. It should nonetheless be added that the theory 
of elliptic functions leads to a parabolic form of weight 12 defined in a very different 
manner, the function 


(17.27) A(z) = e(2) TJ [1 e(nz)]** = fz)", 


n>1 


where 7(z) = e(z/24) [][1 — e(nz)] is the Dedekind function of n° 3. As 


n(z +1) =exp(ni/12)n(z), n(-1/2) = (2/1)? nl2), 


and as, in Ramanujan’s notation, 


(17.28) A(z) = e(z) [1 — 24e(z) +... = oa T(n)e(nz) 


is clearly a power series in e(z) without a constant term, A is obviously a parabolic 
form of weight 12. 


It was mentioned above that Poincaré series are parabolic because they are 
integrable over ’\G. The next general theorem justifies this point: 


Theorem 19. (a) Every entire modular form of weight r < 0 (resp. r = 0) is 
trivial (resp. constant). 
(b) For any modular form of weight r > 4, 


(17.29) f(z) = f(00) Er(z) + g(2) 
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where g is parabolic. 
(c) A solution of 


fz) =IG2)" Fl), 722, 
holomorphic everywhere on P is a parabolic form only if (resp. if) 
(17.30) fr € L?(I\G) 


for all (resp. for some) p < +00. 
(d) The Fourier coefficients of all entire (resp. cusp) forms of weight r > 2 
satisfy 


(17.31) Qn = O(n") (resp. O(n"’?)) : 


(a) Let f be an entire modular form of weight r. Then, as was seen above, f(z) 
is bounded on the fundamental domain F’. If r = 0, it is bounded on P since it is 
T-invariant. Removing the constant term of its Fourier series, it may be supposed to 
be trivial, in which case, f(z) tends to 0 as Im(z) tends to +oo. From the arguments 
already used in n° 5, it then follows that |f(z)| reaches its maximum at a point 
of F’, and as its maximum in F' is also its maximum in P, the function is constant. 

If r < 0, like f and y’/?, the function |y"/?f(z)| is bounded on F. As it is 
T-invariant, it is bounded on P. Hence f, € H?°(G), a space which reduces to 0 
for r <0 [n° 16, (ii), theorem 14]. 

(b) Obvious because of (25). 

(c) Formula (7) for calculating Fourier coefficients shows that 


lanl < exp2nny) § [fle + iy)| dr. 
For p < +00, to start with it, we deduce (Hélder for [0, 1]) that 
lanl? < exp(2anpy) [fle + iy) de 


So, setting r/2 =k, for all T > 0 


vf(e)[" dm(2) > lanl? ff exp(—2nnpy)y"*"a"y, 


y2T 


For n < 0, the second integral diverges for all p and k. Hence, if the left hand 
side is finite for some p < +00, then a, = 0 for n < 0. For n = 0, convergence 
would suppose pk < 1 and so necessarily k < 1. Thus ao = 0 also holds if r > 2. 
Conversely if f is parabolic, then f(z) = O(exp(—27y)) for large y. This condition 
is more than sufficient to ensure f; € L?(I’\G) for all p and k, including p = +oo. 

Conversely, if f(g) is bounded on G, it is in L?(I’\G) for all p < +00 since 
I\G has finite volume, which reduces the proof to the previous case. 

(d) If f is parabolic, y’/? f(z) is bounded on P and the inequality 


lan| < exp(2rny) ¢ Fle+iy)| de 


shows that 


lan| < My” exp(27ny) 
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for all y. Putting y = 1/n, the expected upper bound follows.®° If f is not parabolic, 
it can be made to be by removing f(co)E,(z). Now, the Fourier coefficients of 
Eisenstein series are proportional to the sums o,-1(n). Such a sum having at most 
n terms < n"~', the result is O(n"), qed. 

Proposition (c) is particularly useful in the case r = 2. It will be seen later that 
there are no non-trivial entire modular forms in this case. But theorem 19 applies 
to far more general groups for which there are no such objections. 


18 — Analogues of Liouville’s Two Theorems 


(i) The Riemann surface of SL2(Z). It was shown above that there are no non-trivial 
entire modular forms of weight r < 0 and that, for r = 0, they reduce to constants. 
This last result resembles Liouville’s first theorem on elliptic functions. There is 
also an analogue for modular forms of Liouville’s second theorem about the zeros 
and poles of elliptic functions. The standard proof which consists of integrating 
df/f along the boundary of the fundamental domain F’ can be found everywhere. 
Since in n° 11, we have reduced Liouville’s theorems to the simpler result that is 
proved in the theory of Riemann surfaces (Chap. X, n° 1, theorem 1), we might as 
well apply the same method here, which remains valid for all generalizations of the 
modular group. 

As a first approximation, the Riemann surface X(I‘) = X attached to I’ is 
the quotient space I’\P. We equip it with the structure of a complex manifold by 
requiring that for any open set 2 C I\P, the “holomorphic” functions on 92 be, 
by definition, the holomorphic I’-invariant functions on the inverse image 1~'(2) 
under the canonical map 7 : P —> X. To show that this gives a Riemann surface, it 
is necessary to exhibit local uniformizers and holomorphic chart changes (Chap. X, 
m1); 

Let a = x(a) be a point of X and I, = I'M Ka the stabilizer of a in G. The 
conformal representation z +> (z — a)/(z — @) of P on the unit disc transforms 
Ka into the rotation group about the origin. As was seen in Chap. XI, n° 15, (vii) 
in a far more general case, there are arbitrarily small open neighbourhoods W of a 
having the following three properties: 


(i) W is stable under Ka, 
(ii) YWHW = YET, 
(iii) the image of W under z+ (z — a)/(z —@) is a disc centered at®” 0. 


For such a W, 7(W) is an open neighbourhood of a in X which can be identified 
with the quotient space [,\W by (i) and (ii). Let us consider a function f(z) on 
W. It can be extended to an invariant function on 7~'(m(W)) = I.W if and only 
if its restriction to W is I,-invariant. Indeed if this condition holds then, setting 
f(z) = f(y~'z) for all z belonging to some yW, the result only depends on the 
choice of y since, by (ii), y is unique mod I... Besides, f is clearly holomorphic on 
I.W if and only if it is on W. “Holomorphic” functions on 7(W) can therefore be 
identified with holomorphic functions on the /4-invariant set W. 

However, the map z +> (z—a)/(z—@) being a conformal representation of W 
on a disc centered at 0, the holomorphic functions on W are the power series in 


°6 This argument, as well the use of Fourier series, is due to Hamburg mathemati- 
cian Erich Hecke, who in the 1930s gave new life to a theory threatened to become 
obsolete by introducing ideas that continue to reveal their importance. 

®7 So W is the set of points of C for which the ratio of distances to points a and @ 
is less than a given number e < 1. The boundary of W is a circle not centered 
at a. 
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(z—a)/(z—@). As was seen after theorem 18, I, = {1,—1} unless a is of the form 
(i) or 7(j). In the first case, the condition of I,-invariance is empty. If a = y(2), 
I, contains two matrices 1 and Sa, up to the factor +1 which acts trivially, where 
Sa = ySy7' satisfies S? = 1. As the only rotation of order 2 about the origin is 
¢ ++ —¢, the holomorphic functions on the J,-invariant set W are the power series 
in [((z—a)/(z—@)]’. Finally if a = 7(j), I’. contains the conjugates of 7 by the three 
matrices 1, ST and T7'S = (ST), up to the factor +1; on the unit disc, these 
are the rotations of angle 0, 27/3 and 47/3, in other words the multiplications by 
cubic roots of unity. The [,-invariant power series are then clearly the power series 
in ((z— a) /(z—a))*. 
We then let n(a) = 1,2 or 3 denote the order of I /{1,—1} and set 


(18.1) Ca(z) = [(z-—a)/(z-@)"™ for ae P. 


These functions are holomorphic and invariant under the corresponding I,. Fur- 
thermore, they clearly have the following property: 


(iv) for z,2’ € P, there exists 7 € Ia such that 2’ = yz if and only if ¢a(z) = 
Ca(z’). 


Hence there is a function ga on m(W) for which 
(18.2) daot =Ca on W. 


By (iv), da is a homeomorphism from 7(W) onto a disc centered at O in C. This 
gives a local topological chart (7(W),qa) of X in the neighbourhood of a. The 
neighbourhood 7(W) of a depends on the choice of W, but, for given a, the functions 
da corresponding to the various possible choices for W are clearly pairwise equal 
on their common domains of definition. 

If “holomorphic” functions on open subsets of X are defined as above, then the 
“holomorphic” functions on 7(W) are precisely the power series in ga. A function 
defined on an arbitrary open subset 2 of X is“holomorphic” if and only if in 
the neighbourhood of any a € 2, it can be holomorphically expressed in a chart 
(7(W), qa) centered at a. 

Chart changes remain to be shown to be holomorphic. This amounts to checking 
that, for all a,a’ € P and neighbourhoods W, W’ of a,a’ satisfying conditions (i), 
(ii) and (iii), Ca is a holomorphic function of ¢,, on WM W’ and conversely. This 
is obvious if a = a’, or if ¢, and ¢q are conformal representations of WM W’ on 
open subsets of C, i.e. are injective on WW’ [Chap. VIII, n° 5, theorem 7]. This 
is always the case if a 4 a’. 

Indeed suppose there are distinct points a; and az in WNW’ such that Ca(a1) = 
¢a(a2). By (iv), there exists y € I’, such that a2 = ya1. It follows that yW’ #W’, 
hence that by (ii) applied to for W’, y € Ty, so that ya’ = a’. A non-trivial elliptic 
matrix having a unique fixed point in P, a = a’, a contradiction. 

In conclusion, the local charts introduced on X = I'\P satisfy the conditions 
set a priori. They turn X into a Riemann surface and, for any open subset 92 of 
X,a function f defined on 92 is holomorphic — this term can now be used for it — 
if and only if f oa is holomorphic on 7~'(Q). This result says precisely that 7 is a 
holomorphic map from P onto X. It generalizes to meromorphic functions: in the 
neighbourhood of a pole a = z(a), such a function f is, by definition, a Laurent 
series in the local uniformizer qa, with finitely many terms of negative degree. 
Hence, in the neighbourhood of a, f om is a Laurent series in [(z — a)(z — a) |" 
and conversely, this property is equivalent to [-invariance. . 

At the same time, denoting by va(f) the order at a of a meromorphic function 
in the neighbourhood of a € X and by va(f) the order at a of a meromorphic 
function f(z) on an open subset of P, 
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Va(f) = val fo m)/n(a) 


for every meromorphic function in the neighbourhood of z(a). It follows that the 
total order of a function f defined and meromorphic on X is 


Yi va(f)= So valfon)/n(a) = 


(18.3) = vi(fom)/2+;(fom)/3+ >) val(fom), 


the latter 5* being extended over all classes mod I" of points a that are not fixed 
points of I’. 

Were X compact, it would show that, like in the theory of elliptic functions, 
Yo va(f)/n(a) = 0 for any meromorphic modular function of weight 0. But X is 
not compact, either because the fundamental domain F' is not so, either, and this 
is a better argument, because of a simple and general result (Poincaré) which will 
be proved later: a discrete subgroup I’ of G such that I'\P is compact does not 
contain any parabolic matrices. 

But the Riemann surface X = I’\P can be compactified by taking into account 
the conditions imposed on the behaviour of modular functions at infinity. Theorem 
18 shows that, for T > 1, the image X(T) in X of the half-plane 6.(T) = {Im(z) > 
T} can be identified with the quotient of G(T) by the subgroup [> = IN B. 
A function f defined on X(T) is, therefore, holomorphic if and only if f o 7 is 
holomorphic on 6..(T), i.e. is a Laurent series in 


18.4) Coo (z) = e(z). 
As e(z) is periodic, there is a function go on X(T) such that 
18.5) doo 0(2) = Cool(2), 


and doo is clearly a homeomorphism from X(T) onto the dis |¢| < exp(—27T) of C 
with its centre removed. On the other hand, all G.(T) clearly satisfy conditions (i), 
ii), (iii) and (iv) provided Ka is replaced by U in (i), Iu by Ix. = I’ N B in (ii) 
theorem 15, (a)], (z — a)/(z — @) by e(z) in (iii) and Ca by Coo in (iv). From the 
above, it then follows that any holomorphic function defined and periodic on 6(T) 
extends to a holomorphic [invariant function on m~'(X(T)). 

We next adjoin a point denoted by oo to X, as in the definition of the Riemann 


sphere or of algebraic functions in Chap. X , and call a set 2 C K=Xu {oo} open 
if 2N X is open in X, and in case oo € M2, if it contains X(T) for sufficiently large 
T.. Setting X(T) = X(T)U {oo} and defining qoo (oo) = 0, ordered pairs (X(T), qoo) 
are clearly charts of X in the neighbourhood of the point oo. 

Let us show that these charts (which assume T > 1) are mutually holomorphi- 
cally compatible (obvious) and with (a(W), qa) introduced above on X. The proof 
amounts to showing that ¢..(z) and Ca(z) are injective on WM G(T). Were Ca 
not so, by (iv) there would exist y € Ia such that yGo(T)#G0(T), which by 
(ii) applied to Gx.(T), would imply 7 € Isc, which is absurd. The function Co. is 
also injective on WM G(T’). Otherwise, by (iv) applied to G(T) there would 
exist y € [sno such that yW #W, hence such that y € Iu, equally absurd. Thus 
the charts at infinity are compatible with those already known on X. 

As modular functions (of order 0) on P have been required to be meromor- 
phic everywhere and for 7 sufficiently large, to have an expansion of the form 
Van Gne(z)” on G(T), it is now clear that the modular functions (r = 0) are 


the meromorphic functions X(L) and that, considered as such, their order veo(f) 
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at the point oo of X is just their order as Laurent series in e(z). To get the total 
order of the corresponding meromorphic function on xX , it therefore suffices to add 
Voo(f) to sum (3). 

To finish, we still need to show that the Riemann surface x (L) is compact. To 


this end, let a, be a sequence of points of Xx 1f infinitely many of its terms are 
equal to oo, extracting a convergent subsequence from it is not difficult. Hence we 
may suppose that an = 7(an) for some an € P, determined mod I and which can 
be assumed to be in F. If supIm(an) < +00, the points an belong to a compact 
subset of P. Thus Bolzano-Weierstrass holds. If supIm(a,) = +00, extracting a 
subsequence, we may assume that limIm(an) = +00, hence that, for all T > 0, 
Gn € Goo(T) for large n. This says precisely that liman = co, qed. 


(ii) Zeros and poles. We are now ready to prove the classic result: 


Theorem 20. For any meromorphic modular form f of weight r, 


vWf) = Sd) va(f)/n(a) = 


amodr 


(18.6) = vi(f)/2 + 0;(f)/3 + voolf)+ J) valf) =r/12. 


a#i,j,00 


If r = 0, as shown above, v(f) is the sum of the orders of zeros and poles of 


the meromorphic function which corresponds to f on X (1). Hence v(f) = 0 in this 
case (Chap. X, n° 1, theorem 1). 

In the general case, if f and g are modular forms of arbitrary weights, then it 
is clear that v(fg) = v(f) + v(g). If f and g have equal weight, f/g has weight 
0, whence v(f) = v(g). As a result, v(f) is an additive function of r € Z, and so 
v(f) = cr with a constant c which needs to be determined. To do this, we choose 
the form A(z) = n(z)** of weight 12 already introduced in n° 16. It is holomorphic 
everywhere, including infinity where v..(A) = 1, and has no zeros in P. Hence 
v(A) = 1. As a result, 12c = 1, c = 1/12 and the proof of (6) follows using the 
Dedekind function. 

A less miraculous argument consists in considering a meromorphic differential 
form w on X. The map 7: P +> X being holomorphic, w has a meromorphic 
inverse image wom = f(z)dz. As wo7 is I-invariant, the meromorphic function f 
satisfies the functional equation f(yz) = J(y;z)~? f(z) for modular forms of weight 
2. Let a = a(a) be a point of X and (x(W),qa) a local chart of X at a. Then 
w = ha.dqa on 7(W) where ha, meromorphic on 7(W), is a Laurent series in qa. If 
it starts with a term q®, by definition, the number p is the order va(w) of w at a 
(Chap. X, n° 1). Then by (2), wom = (haom)d(qa07) = (ha om)dGa on W. By (1), 
dea = (z—a)"~1eq(z)dz where ca(a) # 0. On the other hand, hao7 is a Laurent 
series in ¢, starting with a term of degree va(w) in Ca, hence of degree n(a)va(w) 
in z—a. Thus 


(18.7) va(f) = n(a)va(w) + n(a) — 1. 


Let us now work in the neighbourhood of the point oo of X. The same arguments 
and the relation 


flz)dz = wont = (he oT) dag = 277 (ha 0 1) A(z) dz 


show that, for sufficient large T, the Fourier series of F on G(T) starts with a 
term of degree Voo(w) + 1, whence 
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(18.8) Voo(f) = Voo(w) + 1. 


This also shows that at infinity f satisfies the definition of modular forms of weight 
2. 
nd (8) and taking into account the fact that 


As a result, adding relations (7) a 
= 3, we see that 


n(a) = 1 unless n(i) = 2 and n(j) 


W(f) = voolf) + >) va(f)/n(a) = 


(18.9) = $5 vow) +14 (1-1/2) + (1-1/3). 


By definition of the genus g of X, (Chap. X, n° 1), > va(w) = 2g — 2. So the right 
hand side of (9) equals 


2g -—2+14+1/24+2/3=29+1/6. 


But since f is a form of weight 2, the left hand side equals 2c, where c if the constant 
such that v(f) = cr for all forms of weight r. Hence c = g + 1/12. In conclusion, 


theorem 20 says that the compact Riemann surface XCD) has genus 0. 

Proving directly that g = 0 is a different matter. If a compact Riemann surface 
is known to be of genus 0 if and only if it is homeomorphic to the Riemann sphere 
— which was not proved in Chap. X —, then we only need to check that X falls in 
this case. However, X is obtained from the fundamental domain F' by identifying 
two points of F that are mapped to each other by I’. This amounts to identifying 
points on the boundary of F’ having the same y coordinate and to adjoining the 
point at infinity to the result. Very gifted for selling their wares, topologists will 
explain that, to understand what is going on, you need to cut the paper of infinite 
length on which you have drawn F along the boundary of F’, then to glue together 
its vertical sides as well as the two circular arcs |z| = 1 located on both sides of the 
Oy axis. You thus get a tube of infinite length,®* and you then contract to a unique 
point its vaguely circular “boundary” with which it “ends” at infinity. The result 
can then “clearly” be deformed into a sphere. Elementary, my dear Watson. 

A less fanciful proof consists in showing that there is a meromorphic function J 
on X witha simple pole at infinity and holomorphic elsewhere. For such a function, 
the map J from X to the Riemann sphere C is bijective because, for any mero- 
morphic function f on a compact Riemann surface and any z € C, the number 
of solution of the equation f(¢) = z does not depend on z (Chap. X, n° 1). As 
JI:X — Cis continuous, J is a homeomorphism — and even an isomorphism of 
analytic manifolds since a holomorphic and injective map is a conformal represen- 
tation. Whence g = 0. But J needs to be constructed. We will do this later, again 
unfortunately, using the miraculous Dedekind function: J = E3/A. 

Exercise 1. If w is a meromorphic differential form on Xx , we showed that wom = 
f(z)dz where f is a meromorphic modular form of weight 2. Prove the converse. 

Exercise 2. Keeping the notation of the previous exercise, express the fact that 
the sum of residues of w is zero (Chap. X, theorem 1) using f and show that there 
is an analogous formula for f of arbitrary weight r. 


°8 Transforming P into the unit disc by z +> (z—i)/(z+i), F becomes a curvilinear 
triangle bounded by three circular arcs orthogonal to |z| = 1 and with one vertex 
on the latter. This vertex plays the role of the point at infinity of F’, which 
simplifies the topological argument. 
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(iii) Construction of modular forms using A(z) and Eisenstein series. Theorem 
20 has immediate consequences. 

If f is an entire modular form of weight r > 2, the integers v;(f), etc. of 
theorem 20 must all be > 0. For r = 2, this is clearly impossible, and so there 
are no non-trivial entire modular forms®’ of weight 2. Besides, such a form would 
correspond to a differential form holomorphic everywhere on X, i.e. on the Riemann 
sphere. For small values of r, it is easy to convince oneself that the only possibilities 
are given by the following table, where vao(f) is included in the sum S> va(f): 


r vi(f) uy(f) D7 va(f) 


4 
(18.10) : 
0 
4 


PrROrRO 


1 
0 
2 
1 
2 


ooCcoco 


However, if f and g are modular forms of equal weight such that va(f) = va(g) 
for all a, including a = oo, the quotient f/g is a I -invariant function without any 
zeros or poles, hence a constant. So for r = 4,6,8,10 or 14, there is at most one 
entire modular form of weight r, up to a constant factor. The existence of such a 
form is obvious: for all r > 4, the reduced Eisenstein series E,(z) = 5>1/(cz +d)” 
satisfies the functional equation, is holomorphic everywhere on P and tends to 1 as 
z tends to the vertex oo of the fundamental domain F’. 

New strange identities follow from this. If f and g are entire modular forms of 
weight r and s, f(z)g(z) is clearly entire of weight r + s. Hence 


(18.11) E4(z)? = Exs(z) ’ E4(z)E6(z) = E\0(z) s E6(z)Es(z) = Ey4(z) 


since, in each case, the left hand sides take value 1 at infinity. 

Let us now suppose r = 12. First we have the case where va(f) = 0 for all a 
except UVoo(f) = 1. The series Ej2(z) falls in this case and is the only one in it, 
up to a constant factor. As it takes value 1 at infinity, f — f(0o)Fi2 is parabolic 
for every entire modular form of weight 12. However, A(z) = n(z)*4 is a parabolic 
form of weight 12 which is zero only at infinity. Hence, if f is another parabolic 
form of weight 12, the function f/A is holomorphic everywhere on P, [’-invariant 
and holomorphic at infinity since it is of the form (aig + ...)/q(1 +...) where 
q = e(z), hence regular. As a result, A is the only parabolic form of weight 12, up 
to a constant factor. So the only entire modular forms of weight 12 are the linear 
combinations of Fy2 and A. 

For r = 12, E1(z)?, Ea(z)Eg(z) and E¢(z)? are necessarily equal to E12(z) + cA(z), 
with a constant c depending on the case. It can be calculated using the first few 
terms of their Fourier series which were computed in the previous n°. For example, 


E¢(z)” 
E\2(z) 
As A(z) = e(z)+..., 


[1 — 504e(z) + ...J? =1— 1008e(z) +... , 
1+ 65520e(z)/691+.... 


(18.12) Ey2(z) — Es(z)? =cA(z) with c = 2°3°77/691. 


The same type of argument shows that E4(z)? — E¢(z) is proportional to A(z). 
As E4(z) = 1+ 240e(z) +... and E6(z) = 1-504e(z)+..., 


°° Naturally, there are meromorphic forms of weight 2, for example E44 /A. 
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Ea(z)? — Ee(z)? = (3.240 + 2.504)e(z) +... = 1728e(z) +..., 
whence 
(18.13) Ea(z)® — Eo(z)? = 1728A(z). 


For a form f of weight r > 14, we use A to reduce to forms of weight < 14. 
To start with, f = f(co)E, +h where h is parabolic; So the quotient h/A is an 
entire modular form of weight r— 12. Thus f = f(co)E, +gA where g is of weight 
r—12.If r—12 < 14, g is proportional to E,_-12 if r # 24 and, if r = 24, a linear 
combination of E42 and A. Redoing the operation if r > 26 leads to the next result: 


Theorem 21. Every entire modular form of weight r is a linear combination of 
functions E,, Ep—12A, By gah: oe 


The sequence ends when it gives nonexistent forms of weight < 2. These functions 
are linearly independent because their power series in gq = e(z) start respectively 
with the term 1,q,q7, etc. So they form a basis for the finite dimensional vector 
space of entire modular forms of weight r. 

Exercise 3. Every entire modular form is a polynomial in E4 and Ee. 


(iv) Application to elliptic functions. Let L be a period lattice in C and let us 
consider the Weierstrass function y = gz(u). It satisfies the differential equation 


(18.13) y? = 4y° — py -¢q 
where 
(18.14) p=g2(L)=605 > 1/w*, g=ga(L)=140) w®. 


We are going to show that, for any set of given coefficients, there is a corresponding 
lattice L, except for one restriction. 
To this end we choose a basis wi, w2 of L such that 


we/wi = ze P. 


So 
(18.15’) p=coE4(z) with co = 120€(4)/wt = 404 /3w1, 
(18.15”) q=c3E¢6(z) with cz = 280¢(6)/wt = 82° /27w% . 


Incidentally, notice the useful detail that c3 = c}. We saw above that 
1728A(z) = Ea(z)® — Eo(2)? = (w/w)? [(3/4)°p* — (27/8)?@?] = 
= (w/w) (p — 27q")3°/2° , 


whence 
(18.16) p> — 27q? = (20/1) A(z). 
As A(z) £0 for all z € P, we need to assume 

p> —27q° #0. 


This relation says that the equation 42? — px — q = 0 does not have double roots. 
This is an exceptional case where the Weierstrass differential equation can be solved 
using elementary algebraic functions. 
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So let us assume this condition holds. We need to show the existence of z € P 
and w satisfying (15’) and (15”). However, the ratio 


(18.17) J(z) = 1728p /(p> — 274") = Ea(z)?/A(z) 


does not depend on w}. If we can find some z for which it takes the value required by 
those fixed for p and q, then we can find some w} satisfying (16), which determines 
it up to a 12th root of unity. As 


p° = E4(z)*(p* — 27q7)/1728A(z) = (x /w1)'?2°3 3 Ea(z)? = 
= [4n* F(z) /3w4]? ; 


relation (15’) then holds modulo a cubic root of unit, which can be removed by 
multiplying it by w:. As (15’) holds, the relation 


E4(z)° [p® — 27q”) = 1728p A(z) = p®[Ea(z)? — E6(z)”] 
shows that 
274° E4(z)* = p’ E6(z)” = c3.Ea(z)? Ee(z)? ; 


If Ea(z) # 0, it follows that q? = c3E¢(z)? giving (15”) up to sign, an ambiguity 
which can once again be removed by multiplying wi by a 12th root of unity, which 
solves the problem in this case. If E4(z) = 0 and so p = 0, it is easier. 

Finally, the proof reduces to showing that there always exists z € P where ratio 
(17) has a given value. Now, this ratio, the modular function J(z) = Es(z)?/A(z) 
that Charles Hermite hoped to encounter on arrival in Heaven, is strictly [’-invariant 
and holomorphic everywhere on P. Its Fourier series at infinity is 


fi+..]/fe(z)+...J=e(z)'+..., 


where as usual the degrees of the omitted terms are greater than those of the term 
explicitly written. In fact 


J(z) = e(—z) + 744 + 196884e(z) + 21493760e(2z) +... . 


As a result, like E4(z), J(z) is a modular function (r = 0) for which v.o(J) = —1, 
and va(J) > 0 elsewhere. So is the function f(z) = J(z)—c for allc € C. For r =0, 
by theorem 20, this implies the existence of some a € P such that va(f) > 0, hence 
of a root of J(z) =c, qed. 

The number z is obviously not unique since yz is equally suitable for all y € I’, 
but thanks to theorem 18, it is the only solution when c # J(i), J(j). 

When c = J(t), vi(f) = 2 because only then does 


vi(f)/2 + 0;(f)/3 + >— val f) = —Vo0(f) = 1 


have integer solutions. Hence in this case, z is again the unique solution mod I’. As 
E6(t) =0 and E,4(i) 4 0, it follows that p 4 0 and q = 0 in this case. 

When c = J(j), the results are the same: v;(f) = 3, p = 0 and q # 0, and there 
is a unique solution mod I’. 

Coming back to differential equation (13) with p> — 27q7 & 0, the previous 
results can be applied. Choose a solution z of (15) then w € C satisfying (14). The 
squares and cubes of formulas (3) then hold. If there is an added factor —1 in (3”), 
it can be made to vanish by multiplying w by an arbitrary 6th root of —1, which is 
then chosen to make sure (15’) holds. It remains to observe that when z is replaced 
by (az + b)/(cz +d), this replaces z = w’/w by (aw + bw’)/(cw + dw’), which is 
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equivalent to taking the basis (aw + bw, cw + dw’) of L instead of (w,w’) and does 
not change L. Therefore, the lattice L only depends on the class of zmod I’. To 
summarize: 


Theorem 22. Let p and q be two complex numbers such that p? —27q? & 0. There 
is a unique lattice L such that p = go(L), q = g3(L). 


To conclude, let us also state the next result: 


Theorem 23. Every modular function is a rational function of J(z). The func- 
tion J is an isomorphism from the Riemann surface X (I’) onto the Riemann sphere 


“nA 


C. 


To see this, we again consider the relation > va(f) = 0 which holds for every 
modular form of weight 0, where va(f) is an integer obtained by dividing the usual 
order va(f) by 2 ifa = y(t), by 3 if a = y(J), and by 1 otherwise. The function 


[[v@ - sor” 


where the product is extended to all a (including oo) where va(f) #4 0, has exactly 
the same zeros and poles as f, including at infinity, and with the same multiplicities. 
So it is proportional to f. The second proposition is obvious since, taking the 
quotient and setting J(co) = co gives a bijective analytic map, hence a conformal 
representation, qed. 

All this is only the beginning of the abc of a theory that, at the end of the 19th 
century, gave rise to a great body of work related to the algebra of number fields, 
Galois theory, the geometry of algebraic curves of genus 1, the “division” of elliptic 
functions, etc. After almost a half-century of hibernation, it again expanded after 
the war under the impulse of Shimura, Taniyama, Weil, etc. who naturally had 
methods and ideas at their disposal that were unknown or not well understood by 
Felix Klein’s contemporaries. ’° 


0 For the former period, see for example Felix Klein and Robert Fricke, Die Ellip- 
tischen Funktionen und ihre Anwendungen (Teubner, 1916); for the relatively 
recent period, Goro Shimura, Introduction to the Arithmetic Theory of Auto- 
morphic Functions (Princeton UP, 1971), Serge Lang, Introduction to Modular 
Forms (Springer, 1976), Anthony W. Knapp, Elliptic Curves (Princeton UP, 
1992). 


394 XII — The Garden of Modular Delights 


§ 7. Fuchsian Groups 


19 — Generalities on Automorphic Forms 


(i) Two lemmas on discrete subgroups. If I is a discrete subgroup of G, z € P is 
said to be a fired elliptic point of I if the stabilizer I’, of z in I’ is not contained in 
{1,1}. As I, is the intersection of I with the compact subgroup Kz, it is a finite 
group. The set of these fixed point is a discrete subset of P (but not of C: they may 
accumulate on R). Indeed as I’ acts properly on P, any z € P has a neighbourhood 
W such that the relation YW #W implies yz = z [Chap. XI, n° 15, (vii)]. Hence 
I. C Tz for all 2’ € W, and as matrices 1 and —1 are the only k € K, leaving 
points other than z fixed, it follows that z’ is not a fixed point of I if z’ 4 z, whence 
the result. 

The parabolic fixed points of I are equally important. To define them, we first 
observe that the elements b € B are characterized by the fact that 


lim bz =o. 


Z=00 


On the other hand, for a matrix g ¢ B, limgz = a/c € R. We are thus led to make 
G act on a set vaguely analogous to the Riemann sphere, namely 


P = PURU {co} = PUP,(R) 


where P;(R) = RU {oo} is the projective line. By definition, we set goo = a/c if 
c #0 and goo = ~ if c = 0. As, for all € € P,(R), there exists x € G such that 
€ = x(00), the relation g(€) = € is equivalent to g~‘xg(oo) = oo. The stabilizers Ge 
of the points of P;(IR) are therefore the Borel subgroups of G. 

€ € P;(R) is then said to be a parabolic fixed point of I if Ig = [1 1G¢ contains 
parabolic matrices # +1. For example, parabolic fixed points of SL2(Z), or of a 
congruence group, are those of QU {oo}, because the only fixed point of a parabolic 
matrix is (a — d)/2c. 

The reader will probably expect there to be also hyperbolic fixed points of I’. 
They certainly exist but will not occur in this chapter and present very complex 
problems when, for some extraordinary reason, they are considered. 

We will need some simple results for groups having parabolic fixed points. They 
are almost trivial in the case of G(Z). 


Lemma 1. Let I’ be a discrete subgroup of G such that [NU F {e}. 
a) For all compact sets M C P, there exists T < +00 such that 


IM Cc {Im(z) < T}. 
b) For sufficiently large T, 
Im(z) >T & Im(yz) >T 
if and only ifyE TAB=Ix. 
Let us first show that 


™ G and more generally GL2(R) are made to act on P;(R) by considering G as a 
group of linear transformations on R?. This group acts on the set of 1-dimensional 
vector subspaces, which by definition is P;(R), and can be identified with RU {oo} 
by associating its slope t, in the elementary sense of the term, to each line y = tx 
with initial point the origin [Chap. IX, § 4, n° 11, (iii)]. 
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(19.1) inf |c(y)| > 0. 


y¢éTNB 


Any matrix y such that c 4 0 can indeed be written as 


by (15.22). Since U/'NU is compact, leaving c invariant, u’ and uw’ may be assumed 
to stay in a fixed compact subset of R. If there exist y, such that c, # 0 tend to 
0, the corresponding u;, and uy can be assumed to converge to limits u’ and wu”. 
Then (Yn) converges to x(u’)a(u”), and so yn € U for large n since I’ is discrete, a 
contradiction which proves (1). 

This being settled, the relation |cz + d|? = (ca +d)? + cy? = y/Im(yz) shows 
that, for c £ 0, Im(z) Im(yz) < 1/c* < m where m > 0 is a constant. Thus propo- 
sition b) of the lemma follows. If z stays in a compact set M, so that inf Im(z) > 0, 
then sup Im(yz) < +00, proving proposition a), ged. 

These results shows that if '\P is compact, I only contains elliptic of hyperbolic 
matrices. Replacing I’ with a conjugate, it suffices to show that [MU = {e}. 
This follows from proposition a) of the lemma since, by assumption, there exists a 
compact set M Cc P such that I.M = P. (Alternate proof: show that, for all y € I, 
the set of conjugates gyg~' is closed in G and deduce the result). 

The next lemma gives more details about the structure of the subgroup of y € I" 
such that c(y) = 0: for these matrices, d(y) = +1. 


Lemma 2. Let I be a discrete subgroup of G having co as a parabolic fixed point. 
There exists a unique w > 0 such that Ux. = U(wZ). The subgroup Px. = IN B is 
then one of the three following groups: 


(19.2) + ¢ 7) (case where —1eET), 
(19.2’) € i) (case where —1¢T), 


(19.2”) & ony. lexceaheie--= be) 


Let x(w),w > 0 be the generator of [NU = Us. If the first row of y € Isc is 
(a b), then y"x(w)y~” = 2(a?"w), whence a? = 1. 

If —1 € I, for example in the case of G(Z), then +y € Uso and we are in 
case (2). 

If —1 ¢ I, then either [5,5 = Uso, or else 5, # Uso. The first case is (2’). In 
the second case, I" contains matrices of the form 


(19.3) le 2) 


with b £ 0. As 7° = 2(—2b) € PNU, 2b = kw for some integer k. But Dx also 
contains matrices 


(19.4) + © Fa - &; (p nee 3) 
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If k is even, then one may choose p = k/2, whence —1 € I’, but this case is excluded. 
So b = kw/2 with k odd for all matrices (3) in I’. Relation (4) then shows that Isc 
contains the matrix 


(19.5) Yoo = e =) 


Its square is the matrix x(w), and so y = y2? for all y € Uo. For y ¢ PNU, hence 
of form (3), b = (2p + 1)w/2; thus y = 7221. As a result, yo. generates I',,, which 
finishes the proof. 

This lemma shows that, for solutions of 


f(yz) = J 2)"F (2), 


there are three possible cases for the action of Ix. 
If —1 eI, then f(z +w) = f(z) and r must be even. 
If —1 ¢ I’, then no restrictions apply to r, and f(z+w) = f(z) again; but if oo 
is irregular [case (2”)], the function f must also satisfy f(z +w/2) = (—1)"f(z). 
Ezercise 1. Let I be the subgroup of y € G(Z) satisfying 


b=c, a=d=(-1)’mod4. 


Show that oo is an irregular parabolic fixed point of I’. 

Lemmas 1 and 2 concern the point at infinity. There are obviously similar 
result for any other parabolic fixed point € = x(0oo): apply the lemmas to the group 
x ‘Ta =I". Setting z = xz’ and 


(19.6) 6x(T) = {Im(z) > T} 


for all T > 0, lemma 1 applied to I’ shows that, for all compact sets M C P, 
x 'Ta.MN6.(T) = @ for large T. Hence, choosing M’ = «~'M and setting 


(19.7) 6.(T) = 2#6.(T), 
for any compact set M C G, 
(19.8) I.MnG6<¢(T)=@ for large T. 


Sets like in (7) are horocycles centered at™ €. Finding its shape is easy. If x € B, 
then € = co and g transforms (6) into another horocycle centered at oo. If « ¢ B, 
we set x = uwb. Since b transforms 6.(T) into a similar set, it may be assumed 
that 2 = uw where u maps 0 onto € since w(oo) = 0. As Im(—1/z) = y/|z|, the 
set wG.0(T) is defined by a relation y/|z|? > T. It is thus a tangent disc to the real 
axis at the origin. Making it undergo the translation z > z + € gives a horocycle 
centered at € represented by the figure below. 

To work in G, we replace horocycles by their inverse images in G. These are 
Siegel domains in G. Those corresponding to horocycles centered at oo are the sets 


(19.9) G(T): a(g)>T >0 
where we have set a(g) = Im(gi) as in n° 15. Obviously 
UG(T) = G(T) 


for all T. Siegel domains corresponding to horocycles centered at € = x(oo) are 
translations 6(T) of the previous sets. 


” The notation G(€) will also be used to denote an unspecified horocycle centered 
at €. 
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b 
Ls S,(T) 
b 
. S(T) * 
= 
WwW 
WwW 
Uu 


fig. 4. 


(ii) Generalities on automorphic forms. We saw that in the case of the group 
G(Z) to obtain theorems about holomorphic or even simply meromorphic solutions 
of the functional equation 


(19.10) frz) = Jz)" fF), 


restrictions must be imposed on their behaviour in the neighbourhood of the point 
oo, the only parabolic fixed point of G(Z), up to equivalence. A general discrete 
group I’ has parabolic fixed points if and only if it contains unipotent matrices. 
It may have none, for example if the quotient I’\P is compact, for there is then a 
compact set M C P such that I'M = P, which contradicts (8). There is then no 
point in distinguishing “entire” forms from “cusp” ones. 

The question arises in the other case, but as already seen in n° 5, the point oo is 
not the only one that matters for the group of the function 6(z). For every parabolic 
fixed point € = x(0o), it is also necessary to consider its stabilizer subgroup I¢ = 
eBae + AT in I, and Ug = eUx~'NT which plays the same role for € as Ux does 
for oo. If € is such a point, so are 7 = o€ for allo € I’ and 


Un =o", In =olzo'. 


Two such fixed points are said to be equivalent. 
The structure of the subgroups I¢ is easy to work out. For the group 


=2 Pc, 
co is a parabolic fixed point. Besides, it is clear that 
haar a Up = 2,2 * 
with the obvious notation. Lemma 2 applies to I’ and gives the result. In all cases, 
Ug = U(wZ)a~* 


for some w > 0, which is obvious, the three cases of lemma 2 being possible for Iz. 
Lemma 1 applied to I’ shows that if G(T) is a sufficiently small Siegel domain 
in G, then 
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yeO(T) #xG(T) — VE Le. 
Hence the canonical map 
I¢\xG6(T) — I\G 


is injective for large T’. 
For any left I’-invariant function y on G, U¢-invariance implies y(xug) = y(xg) 
for all u € U(wZ). If y = f, for a solution of 


fz) = I)" FE), 
then by (15.6), the function y(ag) = L(#~*)y(g) corresponds to the function 
L,(a~")f :2-—+ I(x; z)7 f (az). 


Thus there is a basic principle: studying the behaviour of f(z) in the neighbourhood 
of € = x(00) is equivalent to studying that of J(x;z)~"f(xz) in the neighbourhood 
of x. 

If f(z) is holomorphic on a horocycle centered at € = x(oo), then there is an 
expansion 


(19.11) J(x;z)"f(az) = So ane(z/w)” 


involving all n € Z and valid for large Im(z), and even for all z if f is holomorphic 
on P. f will be said to be holomorphic (resp. parabolic) at € if an = 0 for all n < 0 
(resp. n < 0). 

Expansion (11) only takes Ug-invariance into account. I:-invariance does not 
impose any further conditions except if the fixed point € is irregular. Indeed, the 
translation z ++ z+ w/2 multiplies function (11) by (—1)”. Hence a, # 0 only if 
n =rmod2. In particular, if r is odd and € irregular, any holomorphic form in & 
is parabolic in €. 

Denoting by 


e,(g,t) = J(g;i) “e(tz), z=gi 


the function of weight r which corresponds on G to the exponential e(tz) for all 
t € R, expansion (12) becomes 


(19.11’) fr(ag) = S> aner(g,n/w) for g € e6(T) 


and sufficiently large T. The J factors have disappeared. 

This expansion only depends on the class of the point €, because if € is replaced 
by 7 = o€ with o € I, the function f,-(xg), replaced by f,(oxg), remains invariant 
and the expansions with respect to € and 7 are mapped to each other by the 
translation g +— og. 

We also need to know the effect of a change of matrix x on (11) or (11’) at 
€ = x(00). The only possibility is replacing x with x’ = xb where b € B, which 
replaces U(wZ) by b-'U(wZ)b. Simple calculations then show that w is multiplied 
by a factor > 0 and the coefficients a, by non-trivial factors only depending on b. 
It follows that a relation such as an # 0 is independent of the choice of x. Saying 
that the function f is holomorphic or parabolic at € has an absolute meaning. 

Besides, the matrix x such that € = x(oo) being only defined modulo B, one 
may assume that x = uw if € 4 oo, which removes all arbitrariness. Then 


-1_ (0-1 
: =(1 2) 
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and «~'z = —1/(z—€), J(a@7';a7'z) = (z—€)~" for all z. So, replacing z by «~+z 


in (11) gives 


(z— 6)" F(z) = Do ane [-1/w(z- 8)” . 
If f is holomorphic at €, it is convenient to set 


f(€) = a0. 
Hence, if ao 4 0, 


(19.13) flax fOe-g)" 


as z tends to € in such a way that e[—1/w(z — €)] tends to 0. For this the variable 
—1/(z—&) = 2~'z only needs to tend to infinity without leaving a strip 


(19.14) a<u<b, v>T>0 


of finite width. A set of this type is said to be a parabolic cusp at infinity in P, and 
its image under the matrix x a a parabolic cusp at the vertex €. Similarly, we define 
parabolic cusps with vertex € in G: these are the inverse images of the parabolic 
cusps with vertex € in P under g +> gi . They are deduced from the parabolic 
cusps at oo by applying the translation g +— xg. Hence a parabolic cusp in G, with 
vertex € = xoo, is defined by inequalities of the form 


(19.14’) |Rea(xg)| <<c<+oo, Ima(sg) >T>0. 


It is obvious that, in P or in G, the invariant measure on any parabolic cusp is 
finite. 

As the images under «x of the verticals a and 6 are circles orthogonal to the real 
axis at €, while x transforms the horocycle y > T into a horocycle centered at &, 
a parabolic cusp at € is a subset of a horocycle centered at € contained between 
two circular arcs orthogonal to the real axis at €. See the figure at the end of 
section (i). Relation (13) is exact on every parabolic cusp at €, but any other type 
of convergence to € gives rise to uncontrollable variations of f(z). 


For r > 0, automorphic forms holomorphic (meromorphic) or parabolic at a 
given fixed point € can be characterized as in theorem 19, (c). More generally, let 
us consider a solution f of the usual functional equation defined on an invariant 
open subset of P, and suppose it is defined and holomorphic on a horocycle centered 
at € = x(co). This gives expansions (11) and (11’). As a(g) = Im(gz) tends to +o0, 
the following possibilities may arise, where G(T) denotes a sufficiently small Siegel 
domain: 


) f-(xg) = Ola(g)%] for some N > 0; 

) f(%) is holomorphic at €; 

) f(z) is parabolic at €; 

) fr(xg) is bounded on G(T) ; 

) |fr(g)|? is integrable mod I~ over xG(T) for all p < +00; 

) there exists p < +00 such that |f;(xg)|? is integrable mod I over G(T). 
To find the logical connections between these statements, we may assume that 
x = e, hence work in the half-plane Im(z) > T using the relation 


lf(gl=9"? |F(| 


and the formula 
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a= $ fle + iyje(—nz/w)dx = exp(2any/w) § f(x + iy)e(—ne/w)ar, 


where integration is over R/wZ. 
To start with, 


(1) = (2) forall reZ, 
because then 
lan| < My%~"”? exp(2any/w) . 


This expression tends to 0 at infinity for all n < 0. 
On the other hand, 


19.15) (3) <=> (4) => (5) ifr>0. 


If f is parabolic, then f(z) = Olexp(—27y/w)], whence (4). Conversely, (4) implies 
ao| < My~"/?, whence (3). (4) implies (5) because a bounded function belongs to 
all L? when integration is over a set of finite measure. 

Finally, 


19.15’) (3) <=> (4) <> (5) <> (6) ifr>2. 


As (5) => (6), by (15), it suffices to show that (6) => (3). The proof of theorem 
19, (c), based on the inequality 


9.16) ff WPA Pdm(e)> Ian? fexp(—2nnpy/a)y ay, 
hea y>T 


applies without any changes since the integral on the right hand side always diverges 
if n < 0. For n = 0, convergence of this integral assumes that r < 2/p < 2. So the 
result follows if r > 2. 

The p = 2 case will be needed later. The integral with respect to ao then 
converges if and only if r < 1. Hence, 


(19.15”) (3) = (4) <— (6) ifr>1, p=2. 
Exercise 2. Suppose that I’ = U(Z) and set 
H?(I\G) =Hr(G)N L?(L\G). 


Under what condition is this space non-trivial? Find properties of the spaces 
H?(I'\P) similar to those of H?(P) of n° 16. 

These calculations enable us to generalize all the definitions made about the 
modular group to I’. In the precise sense of the word, an automorphic form of 
weight r for I, must satisfying the following conditions: 


(FA 1): f is meromorphic on P; 

(FA 2): f(yz) = J(7;2)' f(z) for all y ET; 

(FA 3): the zeros and poles of f in P are divided into finitely many classes 
mod I’; 

(FA 4): in the neighbourhood of a parabolic fixed point of I’, the Fourier series 
of f only has finitely many terms of degree < 0. 


§ 7. Fuchsian Groups 401 


f is said to be entire (resp. parabolic) if it is holomorphic on P and holomorphic 
(resp. parabolic) at parabolic fixed points of I’. Generalizing n° 18, (i) about the 
modular group, we are now going to show that automorphic functions (r = 0) on 
I can be identified with meromorphic functions on a Riemann surface associated 
to I. 


(iii) The topology of horocycles. The Riemann surface of I’. As already men- 
tioned, the group G acts on P = PUP,(R). A strange topology but one fitted to to 
the theory of automorphic functions can be defined on P”™ because, in this topol- 
ogy, automorphic forms on a group I” holomorphic at some parabolic fixed point € 
become continuous at €. To define it, we declare a set E C P to be open if EM P is 
open in P in the usual sense and if, for all € € E M P,(R), E contains a horocycle 
centered at €. The neighbourhoods of € € P'(R) are then the sets containing a 
horocycle centered at €. The elements of the group act on P as homeomorphisms 
as they map horocycles to horocycles. 

A sequence of zn € P converges in this topology to another point € = x(oo) if 
and only if 


lim Im [x (2e)] = +00. 


Indeed, for all T, we need to state that zn € 76.0(T), i.e. Im(x~'zn) > T, for large 
n. For this, it is enough that z, converges to € in the usual sense staying all the 
while in a parabolic cusp with vertex €, a notion which like above can be defined 
without referring to any group I’. 

Obviously, P , equipped with this topology, satisfies the Hausdorff axiom. P;(R) 
is discrete in this topology because the only point of Pi (IR) contained in a a horo- 
cycle centered at € is € itself. But P is not locally compact: the sequence zn =i+n 
does not contain any subsequence converging to a point of P, nor any subsequence 
converging to oo in the topology of P since this would imply that lim Im(zn) = +oo. 
Hence no horocycle satisfies BW. 

On the other hand, every parabolic cusp 


E:-w<a<a<b<+o0, y>T>0d0 


at oo is compact in P. Indeed for a sequence zn € EM P, either supyn < +00 
and it then contains a subsequence converging in P to a point F since x, stays 
in a compact subset of R, or supyn = +oo and it then contains a subsequence 
converging to the point oo € FE in the topology of horocycles. Thus F is compact. 
As the elements of G act on P as homeomorphisms, we conclude that every parabolic 
cusp ts compact in P. 


To obtain the Riemann surface associated to a discrete group I’, we replace P 
by the union P(r) of P and the countable set of parabolic fixed points of I”. Hence 
for [ = G(Z), P(r) = PU P,(Q). The group I acts on P(r) by homeomorphisms, 
which enables us to define quotient spaces 


73 T am unable to say who had the idea first, but its inventor must have clearly 
been familiar with baroque topologies, which, before the 1960s, excludes mod- 
ular function specialists dedicated to classical analysis. G. Shimura uses it in 
his excellent Introduction to the Arithmetic Theory of Automorphic Functions 
(Princeton UP, 1971), I presented it in my lectures in 1971-72, but Henri Cartan 
was already familiar with it in the 1950s. 


402 XII — The Garden of Modular Delights 


X(P)=P\P(r), X(r)=P\P(L). 


Theorem 24. Let I be a discrete subgroup of G and a,a’ two points of P(L). 


(a) IfW and W’ are two sufficiently small neighbourhoods of a and a’ in P(r), 
then 


(19.17) yW #W' = a=". 


(b) The quotient space X([') = I°\P(I’) is locally compact. 


To prove the first proposition, we need to distinguish three cases. 

(1) If a,a’ € P, there is a more general result available [Chap. XI, n° 15, (vi), 
theorem 24]. 

(2) If a € P and if a’ = x(oo) is a parabolic fixed point of I, replacing I" by 
xIx—' and a by x~ ‘a reduces the proof to the case where a’ = co and W’ = G(T). 
It then suffices to show that, for any compact set W C P and sufficiently large 
T>0,TWN6.(T) = @. This is lemma 1 of section (i). By the way, observe that 
the proof of this lemma also shows that 


(19.18) inf |c(y)| >0, 


VET oo 


where c(y) denotes the coefficient c of y, so that c(y) = 0 is equivalent to 7 € Isc. 

(3) Finally suppose that a = € and a’ = €’ are parabolic fixed points of I. 
This immediately reduces the proof to the case €’ = oo. Setting € = (oo) for some 
x € G, we need to show that if G = {Im(z) > T} and 6’ = {Im(z) > T"} are 
closed horocycles centered at oo and if T and T” are sufficiently large, the relation 
yxO #: 6’ requires yE = 00, ie. yx € B. 

If € = o(co) for some o € I’, then x = ob for some b € B, and as b transforms G 
into another horocycle centered at oo, yexG #G if and only if yoG #G for some 
other G. Hence the proof reduces to showing that 


YG #G SS VET xo 


if G is sufficiently small. This is point (b) of lemma 1 of section (i). 

If € does not belong to the same class of fixed points as oo, ie. if « ¢ B, the 
horocycles yaG = yG(€) are tangent discs to the real axis. If such a disc has non- 
trivial intersection with 6’, then this is also the case of its intersection with the limit 
horizontal Im(z) = T’. Multiplying y on the left by an element of "MU, yG(€) can 
even be assumed to have non-trivial intersection with some fixed compact subset 
C’ of this horizontal. But then the intersection of y~'C’ and G(€) is non-trivial, 
which, by lemma 1, is impossible if G(€) is sufficiently small. 

Let us now prove the second proposition of the theorem. As I’ is a group of 
homeomorphisms of P(I”), the canonical map 


nx: P(r) — X(L) 


is open. Proposition (a) of the theorem then shows that X (I°) is separated. On the 


other hand, the image in ax (I) of any z € P clearly has a compact neighbourhood, 
for example the image of the closed disc centered at z. If € is a parabolic fixed point, 
the image of any horocycle’* G(€) is a neighbourhood of 1(€). But Ue enables us to 
put all z € G(€) in the same parabolic cusp at €. As any parabolic cusp is compact 


in the topology of P, so is the image of G(€), qed. 
4 We set € € G(E). 
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Applied to a parabolic fixed point € = x(oo), proposition (a) of the theorem 
says that if G(€) is a sufficiently small horocycle centered at €, then 


VS(E) #S(E) — > VE Te. 


A function f defined on G(€) can be extended into a solution of 


f(yz) = F(z)" F(z) 


defined on ['G(€) if and only if, as in n° 18, this relation is satisfied by all z € G(€) 
and all y € I’:. Ue-invariance means there is (if f is holomorphic) an expansion 


J(x;z)"f(az) = x ane(nz/w). 


But this condition is sufficient for ensuring [¢-invariance only if € is a regular 

parabolic fixed point. If € is irregular, [~ contains the conjugate under x of matrix 

(5) of lemma 2. This gives the additional condition f[x(z + w/2)] = (—1)’f(«z), 

which, as was seen above is equivalent to a, = (—1)"~"ay. For r even (resp. odd), 

we, therefore, find an expansion depending on the even (resp. odd) powers of e(z/w). 
The different cases can be unified by setting 


(19.19) Ce(z) =e [2-3 (2) /we] 
where 
(19.19’) ue wW if € is regular, 


w/2 if € is irregular. 


For all r if € is regular, for all even r if € is irregular, any holomorphic form of weight 
r on a horocycle G6(€) centered at € is then a Laurent series in variable (19) in this 
horocycle. Conversely, any Laurent series in ¢, defined in G(€) can be extended into 
a solution of the functional equation on I.G(€). Using variable (19), we define the 
order ve(f) at the point € of a function of weight r (even or odd) for ’ meromorphic 
at €: it is an integer if r is even and of the form 4 +n if r is odd and € irregular. 


To equip x (I) with the structure of a complex manifold, we proceed as we did 
in n° 18, (i) for the modular group, so that we need not spell everything out in 
detail. 

To start with, this structure has to be defined on X (I) = I’\P. Hence we need 
local uniformizers at each point a = 7(a). For this we use open neighbourhoods W 
of a in P satisfying conditions (i), (ii) and (iii) of n° 18, (i). Since the conformal 
representation 


ze > (z—a)/(z—-@) 


transforms the actions of the stabilizer I, into a group of rotations of finite order 
n(a), holomorphic J,-invariant functions on W are, as in n° 18, power series in the 
variable 


Ca(z) = [(z -—a)/(z —a@]"™ 


and property (iv) of n° 18 continues to hold here. Denoting by qq the function which, 
in 7(W), corresponds to Ca, as in n° 18, (7(W), qa) is a local chart of X(I’) at a 
and, properties (i) to (iv) of n° 18 tell us that these local charts are holomorphically 
compatible. 
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We then need to extend this structure to X(L) = T\P(L) and so define local 
charts in the neighbourhood of 7(€) for € = x(0o). It was shown above that a 
holomorphic function f(z) defined in a neighbourhood W = 6(€) of € is I¢-invariant 
if and only if it is a power series in the [¢-invariant function ¢¢(z) which only 
depends on € up to a constant factor. Since it is clear that 


Ce(z) = Ce(2’) m(z) = m(z'), 


the local chart (7(W), ge) of X(r) at 7(€) is then obtained by transforming ¢¢ into 
a function ge on 1(W). 

Asin n° 18, (i), it follows that the charts of X (I’) thereby obtained are mutually 
holomorphically compatible. More precisely, if a@ and a’ are two distinct points 
of P(I’) and W and W’ are neighbourhoods of a and a’ satisfying the required 
conditions, then ¢, and ¢,/ are injective on W MW’ (hence holomorphic functions 
of each other). Indeed, if ¢a(z1) = Ca(z2) for distinct points in W m W’, then there 
would be some y € I, such that z2 = yz1, whence yW’ #W’ and so y € I. This 
is impossible for a 4 a’ since a parabolic matrix has a unique fixed point in P. 


(iv) Fuchsian groups. The Riemann surface of the modular group is compact 
because it is covered by the image of any sufficiently large horocycle. Because of 
this trivial remark, the following definition of Poincaré’s Fuchsian groups becomes 
natural: these are discrete groups I" satisfying conditions 


(GF 1) the parabolic fixed points of I’ are divided into finitely many classes 
T&, 

(GF 2) there exist a compact subset M C P and finitely many horocycles G(&;) 
such that 


P=|JoF where F=MuU|J6(&). 


The reader will probably observe that the first condition could have been omitted 
since it clearly follows from the second one. The compact set M is superfluous 
if I’ has parabolic fixed points: it suffices to choose sufficiently large horocycles. 
Besides, condition (GF 2) can be immediately restated for G: there exist a compact 
set M C G and a Siegel domain G such that G = I’F where F is the union of M 
and of finitely many translations 7,6 of G. 

Clearly, G(Z) satisfies these conditions, with n = 1 in this case. In the trivial case 
from this point of view of a group with a compact quotient, we choose F' = M. Any 
subgroup of finite index in a Fuchsian group I’ is again a Fuchsian group (obvious). 
In particular, all congruence subgroups of the modular group are Fuchsian groups. 

The set F' of condition (GF 2) is obviously not a proper fundamental domain 
for I’. To start with, each horocycle G(€;) is invariant under the stabilizer of the 
point €; in I’. If €; = o, this is a discrete group of horizontal translations, enabling 
us to put every z € G(oo) in a strip of finite width, i.e. in a parabolic cusp at oo. 
The same holds for the other €;. In other words, condition (GF 2) can be satisfied 
by replacing each G(&;) by a parabolic cusp at €;. Since a closed parabolic cusp is 
compact in the topology of horocycles, as expected, the Riemann surface x (L) is 
compact. 

Every parabolic cusp in P having a finite and invariant measure, so does F’. 
However, by assumption, the map from P onto I"\P is surjective when it is restricted 
to F’. So theorem 24 can be completed as follows: 


Theorem 25. The Riemann surface X(L) of every Fuchsian group I is compact. 
The total mass of the invariant measure on I'\G is finite. 
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The converse holds and is mostly interesting because of its proof. The easiest is 
to show that, if X(I’) is compact, then I" is a Fuchsian group. Indeed, we saw that 
the set of parabolic fixed points of I’ is discrete in P(I’). It is also closed. Its image 


in X(I’°) having the same properties, it is finite, giving (GF 1). Then for each € let 
us choose an open horocycle G(€) in such a way that G(yé) = yG(é). The image 


of P—UG(€) in EP) is clearly a set contained in X(JI’), closed in X(I°), hence 
compact, and so the image of a compact subset”? M of P. Then P—I'G(€) = I'M, 
whence (GF 2). 

Showing that the relation m(I\G) < +00 implies (GF 1) and (GF 2) is far 
from being as easy. The proof is mainly interesting because it requires construct- 
ing genuine fundamental domains bounded by finitely many circular arcs or rays 
orthogonal to the real axis. This amounts to non-Euclidean geometry open to con- 
siderable generalizations. But all this would take up too much space.”© 

For a Fuchsian group, these constructions show that, for r = 0, automorphic 
forms on I’ can be identified with meromorphic functions on the compact Riemann 
surface X (1°). Poincaré had already announced in his early works that, conversely, 
any compact Riemann surface X of genus g > 2 is isomorphic to X(I°) for some 
group with compact quotient acting without fixed points, hence not containing any 
hyperbolic matrices. The proof consists in considering the universal covering of 
X and in proving that any simply connected Riemann surface is isomorphic to 
the Riemann sphere or, if it is not compact, to some open subset of C, hence, 
by Riemann’s theorem on simply connected domains, either to C, or to P. The 
fundamental group of the compact surface X considered then becomes a discrete 
group I’ of analytic automorphisms of C or of P, so that either X = C/L, or X = 
I’\P where, in the former case, L is the period lattice in C (elliptic functions, g = 1) 
and, in the latter, a Fuchsian group with compact quotient acting without fixed 
points (automorphic functions, g > 2). In the latter case, the rational function field 
of X is just the field of automorphic functions on I(r = 0), which, for any Fuchsian 
group, is indeed an algebraic function field of one variable as in the theory of 
elliptic functions. Applying the result to the Riemann surface associated in Chap. X 
to an irreducible algebraic equation P(X, Y) = 0, we conclude that there are two 
automorphic functions f and g on I’ such that all solutions of P(z, w) = 0 are given 
by z = f(u),w = g(u) for some u € P unique modI’. This is the generalization 
of theorem 12 of n° 14, (ii) about elliptic functions, a result which could be called 
Poincaré’s youthful dream by analogy to Kronecker’s in another domain. 

Relations between the French and Germans, even in the case of mathemati- 
cians, was somewhat ambiguous between 1870 and 1914. It has needed at least a 
quarter century of cooperation-competition between Poincaré and some Germans 
(H.-A. Schwarz and especially Paul Koebe) to prove this difficult result using meth- 
ods from potential theory (existence of harmonic functions whose only singularity is 
a simple pole) at a time when this domain of analysis was still sufficiently primitive 
for Poincaré himself to make mistakes. In the early 1950s, another Franco-German 
undertaking made all this far more comprehensible by using algebraic topology and 
“coherent sheaves” of analytic functions. 


At a much more accessible level, using general theorems on compact Riemann 
surfaces, presented in Chap. X, would enable us to obtain immediately proposi- 


” Every point 7(a) of X(I°) has neighbourhoods of the form 7(W) where W is a 
compact neighbourhood of a in P. Hence the existence of M by BL. 

6 The construction is in Poincaré, but the first fully correct proof is probably the 
one found in C. L. Siegel, Topics in Complex Function Theory, vol. 2 (Wiley, 
1971). 
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tions (a) and (e) of the following result, which generalizes theorem 19 of n° 17. But 
it is possible to proceed differently. Proposition (e) and its proof can be general- 
ized to situations where neither Riemann surfaces nor holomorphic functions are 
encountered. 


Theorem 26. Let I’ be a Fuchsian group. 

(a) Every entire automorphic form of weight r is trivial (resp. constant) ifr <0 
(resp. r = 0). 

(c) An everywhere holomorphic solution f on P of 


fz) = J 2)" Ff 
is a parabolic form if and only if f- € L°(L\G), whenr > 0. When r > 2, only if 
(resp. if) 


(19.22) fr € L?(P\G) 


for all (resp. for some) p < +00. 
(d) The set of parabolic forms of weight r > 1 is identical to the space 


H2(F\G) = L-(P\G)NH,(G). 


(e) If f is parabolic, its Fourier coefficients at every parabolic fixed point of Doo 
satisfy an = O(n"/?). 
(f) The space of entire forms of weight r is finite-dimensional. 


(a) If r = 0, the function f(z) is holomorphic everywhere in X(I°), hence con- 
stant. If r < 0 and if oo is a parabolic fixed point, f(z) is bounded for y > T, 
hence also y"/?| f(z)| = |f-(g)| since r < 0. For an arbitrary parabolic fixed point 
€ = x(00), the function f,(xg) is bounded for y > T. So f;(g) is bounded on xG for 
any Siegel domain (9) in G. By (GF 2), I’\G is the union of finitely many images 
of sets of this type. As a result, f, is bounded on G, especially if '\G is compact, 
and so is in Hz°(G). Now, this space reduces to 0 if r < 0 (n° 16, theorem 14 for 
p= +eo). 

(c), (d) If f is parabolic and if r > 0, the invariant function y"/?| f(z)| is bounded 
on every horocycle of P(I’), hence on P by (GF 2). So parabolic forms belong to 
L*(I’\G), and thus necessarily to L? since m(I\\G) < +00. Conversely, if f, is 
bounded, f, is parabolic by relation (15) of section (ii). 

To make sure that f, € L?(I\G) for given p < +00, it suffices to check con- 
vergence in every parabolic cusp since I’\G is the union of a compact set and 
images of finitely many parabolic cusps. Propositions (c) and (d) then follow from 
equivalences (15’) and (15”). 

(e) Same proof for the modular group. 

(f) An entire form f is parabolic if and only if f(€) = 0 for every parabolic 
fixed point of I’, which forces a finite number of linear conditions on f. Hence it is 
enough to prove (e) for the subspace S,(I"\G) of parabolic forms, the latter being 
trivial if r < 0. Ifr > 1, by (15) and (15”) 


Sr(I\G) = Hr(L\G) nN L(P\G) = He(P\G) N L&(L\G). 


Moreover, S,(I\G) is closed in L? and L® since, in all L?(I’\G), convergence 
in mean for holomorphic functions implies compact convergence, and so preserves 
holomorphy. Now, m(I’\G) < +00. So all we need is to prove a general lemma: 


Lemma 3. Let X be a locally compact space, 4 a bounded positive measure on 
X and H a subspace of L?(X;p) MN L°(X;) closed in L?(X;p) and L*©(X; 1). 
Assume that all classes f € H contain a continuous function. Then dim(H) < +00. 
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We denote by H? (resp. H®) the space H equipped with the L? (resp. L®) 
norm. By the Fischer-Riesz theorem, the set of pairs (f, f) where f € H is closed 
in the Banach space H? x H®. Thus the two topologies introduced on H are the 
same (closed graph theorem), and so there is an upper bound 


IIflloo < MIlfll2 


valid for all f € H. If S is the support of ~ (complement of the union of open null 
subsets) and if every class f € H is identified with the unique continuous function 
on S which it contains, the left hand side is the uniform norm of f on S. So for all 
xe€S,|f(x)| < M||fll2. This gives an wz € H such that 


f(x) = (flwe) 


for all f € H. Moreover, by the previous inequality, ||wz||2 << M. Then let (e;) be 
an orthonormal basis for H. For any finite subset F' of the set of indices 7, 


S- lei(x)? = ¥- |(eilwe)|? < |lwell3 < M?. 
F 


F 


Thus, integrating, 37» |lei||3 < M?u(X). The left hand side equals Card(F), ged. 


Using the Riemann-Roch theorem on Riemann surfaces, the dimension of the 
space of entire or cusp forms of weight r can be explicitly calculated.”’ In partic- 
ular, for r = 2, parabolic forms correspond precisely to everywhere holomorphic 
differential forms on X(I°). So the space of parabolic forms of weight 2 has dimen- 
sion g. The space of entire forms of weight 2 can also be shown to have dimension 
g +N —1, where N is the number of classes of parabolic fixed points. This shows 
that ( as will be seen, contrary to what happens for r > 2), the values f(€) of an 
entire form of weight 2 cannot be chosen arbitrarily at parabolic fixed points. This 
is no great surprise. If f(z) = lang” where q = e(z/w) or e(z/w)® is the local 
uniformizer at infinity and if wy is the differential form corresponding to f(z)dz on 
x (1), then 


wy = >> ang" da/q 


up to a factor 27% or 471, so that wy has a simple pole at (co), where its residue is 
equal to f(oo). Hence >> f(€) = 0, which gives a non-trivial linear relation between 
the f(€). Saying that the dimension is exactly g+ N —1, therefore, means that there 
are meromorphic differential forms on x (I’) whose only singularities are simple 
poles at given points with given residues, provided their sum is zero. This is a 
general result about Riemann surfaces. In the case at hand, standard methods 
(Poincaré or Eisenstein series) do not lead to it because they always assume r > 2. 

Theorem 20 on zeros and poles of meromorphic modular forms can also be 
generalized to Fuchsian groups. We will see later (theorem 27) that they exist for 
all r, even if —1 € I. First of all, it is clear that automorphic forms of weight 0 
are precisely the meromorphic functions on X(I°). So, for any form f of weight 0, 
Yo va(f)/n(a) = 0, the sum being extended to I\P(L), with n(a) = 1 except if a 
is an elliptic fixed point of I’. The arguments used to obtain theorem 20 show that, 
for f of weight r, 


S~ va(f)/n(a) = er 


"7 See for example T. Miyake, Modular Forms (Springer, 1989), Chap. 2. 
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where the constant c is independent of r. We calculate it for r = 2, in which case 
f(z)dz = wy om for some meromorphic differential form wy on X(I”). The problem 


amounts to comparing the usual order va(f) of f at some point a € P(r) and the 
order va(wy) of wy at a = m(a). The arguments and the result are the same as in 
n° 18: 


va(f) = n(a)va(ws) +r(a)—1, or valf) = valve) +1, 


according to whether a is in P or a parabolic fixed point. Hence 


do va(f)/n(a) = S79 vawe) + D5 [1 - 1/n(a)] +N 


where N is the number of classes of parabolic fixed points and where the latter }> 


is extended to all classes of elliptic fixed points. Thus, if x (I) has genus g, then 
setting n(a) = +00 if a is a parabolic fixed point, 


2c=2g-24 Sot 1/n(a)] . 


So the problem reduces to the calculation of the genus of X(JI°). I will not discuss 
these questions at any greater length as they as they have little to do with group 
theory. 

Exercise 3. (a) Show that if f(z) is an automorphic form of weight r for the 
modular group, the function f[(z + 1)/2] is an automorphic form of weight r for 
the group I'(@) of the Jacobi function (see exercise 3 of n° 3). 

(b) Using the arguments of n° 5, show that 


m0(z)® = 48Ga(z) — 3Ga [(z + 1)/2] . 


Deduce a formula giving the number of representations of an integer as a sum of 
eight squares. 


20 — Parabolic Forms and Representations of G 


Theorem 26 shows in particular that, for a Fuchsian group, 
yp € L?(I\G) = ¢ parabolic if r > 1. 


However, for any discrete subgroup I’, the group G acts on L?(I\G) by right 
translations V(x) given by 


V(x)e(9) = v(gz)- 
This gives a unitary representation of G. The closed subspaces 
H>(L\G) = H,(G)N L?(P\G) 
are not invariant, in particular because the translations V(x) destroy the relation 


(20.1) plgk) = v(g)xr(k) or Vik)y = xrlk)p 


satisfied by all y € H2(I’\G), not to mention the holomorphy condition. But for all 
y € L?(L\G), in particular holomorphic, the elements V(x)y generated a closed 
invariant subspace H(y) in L?(I°\G). Similarly, As « € G and y € H2(L\G) vary, 
the elements V(x)p generate a closed invariant subspace D,. It is the closure of 
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the sum of H(y) for y € H2(I’\G). One of the results of the new theory is that 
representations of G obtained on H(y) are all isomorphic to the conjugate’ of the 
representation by left translations on the space #2(G) defined in n° 16. This at the 
very least supposes r > 1 for otherwise H2(G) would be trivial. 

Henceforth, as Harish-Chandra would have done, we will denote by 7, the 
representation of G on H2(G). So m,(x) is the restriction to H2(G) of the left 
translation operator L(x) on L°(G). For all r > 1, w, will denote the kernel function 
of H2(G), defined in n° 16, (iv). It is integrable for r > 2. Because of this property 
calculations are well-defined for r = 2, which they otherwise would not be. I” will 
only be assumed to be a Fuchsian group simply because this assumption is perfectly 
useless: the theorem applies to the group I’ = U(Z) for example and to I’ = {e}, a 
case not without significance. We will only need to know that H?(I°\G) is a closed 
subspace of Lr \G). This is clear since, as it concerns holomorphic functions, 
convergence in mean implies compact convergence. We assume r to be an integer 
because we are working in SL2(R). Similar results can be derived for a real r by 


using its universal covering G provided the centre of G is taken account of as in 
Chap. XI by proving the Bargmann orthogonality relations. 


Theorem 27. Let I’ be a discrete subgroup of G and x +— V (a) the representation 
of G on L?(I\G) by right translations. For any » € H2(I'\G), let H(y) be the 
smallest closed invariant subspace of L?(I\G) containing y. 

(i) (r € Z) The representation (H(y),V) of G is irreducible. 

(ii) (r > 1) It is equivalent to the conjugate of 7, and is square integrable and 


202) (Veaele) =f elunietian = [PEur(a) ifr 22. 


(iii) (r > 2) The semilinear map J given by 


NIF 


(20.3) If(w) = wr(e)? (V(x)glf) = wr(€) i= (yx) f(y)dy 


is an isometric isomorphism from H(y) onto H2(G) compatible with the represen- 
tations of G on these spaces. 

(iv) (r > 2) The orthogonal projection L?([\G) —> H2(I\G) is the map 
f to f *ur. 

(v) (r > 1) For any closed invariant subspace H of L?([’\G) such that the 
representation (H,V) is isomorphic to the conjugate of m,, there exists p € H2(G), 
unique up to a constant factor, such that H = H(¢). 

(vi) (r > 1) If (fi) is an orthogonal basis of H2(I\G), the subspaces H(fi) 
are pairwise orthogonal and their direct sum is the closed invariant subspace D, of 
L?(I'\G) generated by H2(I'\G). 


The proof found everywhere uses differential operators associated to the Lie 
algebra of G. It will be presented in n° 30. The one given here uses the simplest 
notions about unitary representations — abstract nonsense — and, instead of Lie 
algebras, integral methods of n° 16. As the spaces H?(G) can be generalized to all 
semisimple groups G for which G/K has a complex analytic structure as shown by 
Harish-Chandra in the 1950s, this is surely also the case of theorem 27. There are 


"8 The conjugate of a unitary representation (H,U) is obtained with replacing the 
map (A, f) > Af by (A, f) > A*f, the inner product (f|g) with (g|f), and 
by preserving the operators U(x). By setting the functions (U(x) f|g) to be the 
coefficients of (H,U), those of the conjugate representation are the functions 


(g|U()f). 
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also reasons to suppose that these results, once suitably reformulated, will apply 
to all locally compact groups by replacing the representation 7, with a square 
integrable irreducible unitary representation, as in the case, of course trivial, of 
compact groups. 

(a) Let us consider the representation of G by right translations V(x) on 
L?(I’\G) and show that, for any p € H7(I\G) and all f € L?(I'\G), the coef- 
ficient 


(20.4) 6; (2) = (V(a)ylf) = / vlya) Fedy 


r\G 


is in H,(G). If f is continuous with compact support mod I, there is [Chap. XI, 
n° 15, (iv)] a function p € L(G) such that 


f(x) = Sop). 


For given 2, the function y +> y(yx)p(y) is locally L? like y, with compact support 
like p, and so integrable over G. As a result (generalized LF), 


Le y(yx) f(y)dy = Pe dy d y(yyx)p(yy)dy = 


(20.5) = [ o(ya)p Ody = B+ ola); 


So, by the final remark of n° 16, the left hand side is in H,(G) like w. If now 
f € L°(L\G), there exist fr € L(P\G) such that lim || f — fnll2 = 0. As 


|5(x)| < Ilelle-Ilflle 


for all x and f, the functions (V(x)y|fn) converge uniformly to (V(x)y|f), whence 
the result. 
(b) As V(k)y = xr(k)y, the function &(x) = (V(x)y\y) satisfies 


(20.6) D(katk’) = xXr(k)O(x) xr(k’) « 


However, in H,(G), (6) determines © up to a constant factor: ® corresponds to a 
constant function on the unit disc D [n° 16, (ii)]. We then consider the operator 
A in H(y) commuting with all V(x) and set Ay = f. The function (V(x)y|Ay) 
is in H,(G) and a trivial calculation shows that it satisfies (6), and so is propor- 
tional to (V(x) ply). Since the V(ax)y generate H(y) topologically, Ay = Ay. Hence 
AV(x)y = AV(x)y and A = A1. So for all r, the representation is irreducible, even 
when H2(I’\G) = {0}... 

(6) also shows that  € H2(G) whenever H2(G) is non-trivial, i.e. for r > 1. 
The representation (H(y), V) is then square integrable since one of its coefficients is 
in L?(G), from which it follows (Bargmann) that the function (V (2) f|f’) is square 
integrable over G for all f, f’ € H(y). 

(c) If r > 1, simply by definition, the kernel function w, of the space H? (G) 
satisfies w, * Wy = Wr = Wy, Le. 


wn (@) = i wn yan ody = (we |L(a)ue) ; 


the inner product being on L?(G). As wy is in H,(G) and satisfies (6), the func- 
tions @ and w, are proportional, and the coefficient of proportionality is necessarily 
indicated in (2): put x = e. But if two irreducible unitary representations have a 
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common diagonal coefficient, they are equivalent as was seen in Chap. XI, n° 31, 
(iii) following the proof of lemma 1. The representation (H(y), V) is therefore equiv- 
alent to the conjugate of w,. Propositions (i), (ii) and (iii) of the theorem are now 
proved. 

(d) Proposition (iv) is, here too, a matter of trivial facts about representations 
admitting a generating vector. Since (V(x) y|y) = A(w,|L(a)w,), there is a semilin- 
ear isomorphism J from H(y) onto H2(G) commuting with the representations and, 
omitting a factor A2 without any significance, mapping y onto w,. By assumption, 


IV(2)=L(2)J, (JflJf’) = (ff) 
for f, f’ € H(y). For f € H(y), 
If (x) = (Jf|L(x)wr) , 


the main property of the kernel function [n° 16, (iv)]. Thus 


If (x) = JFIL(@) Je) = SAV @)e) = V@elf) » 


which is formula (3). 

Exercise 1 (r > 2). Show that on the everywhere dense subspace H;(G), the 
map J~' is given by J~'f = y* f up to a constant factor. (On the face of it, it 
is not obvious that this map extends continuously to H?(G). This follows from the 
existence of J). 

Exercise 2 (r > 1). Using the Bargmann orthogonality relations, show directly 
that there is a constant c such that 


(Ps lO) =e (FIP) 


for all f, f’ € H(y). The inner products are calculated on L?(G) and L?(I\G) 
respectively. 

(e) Before proving proposition (iv), we recall ne it is aa to associate 
to any bounded measure ps on G an operator V(y) = f V(y) in L?(I\G) 
[Chap. XI, n® 25, (i)]. We have ||V(#)|| < Ill, 


Viuxv) =V(u)V(v), Vin) =V(u)". 


All closed invariant subspace of L*(I"\G), in particular the subspaces H(y), are 
stable under the operators V(). For f, f’ € L?(I\G), 


(V@rsit’) = a (Ve) FF") dat@) = / due) I ful Wit = 


(20.7) ety f'(ydy | f(yx)du(x) = (f af’) , 


but we need to justify the permutation of integrals needs to be justified and show 
that f * pf € L*(L\G) in order to prove that this formal calculation is legitimate. 
Now, f and f’ being in L?(I°\G), the function 


LR> if | f( yx) f (y)| dy 
is bounded above by || f|2||f’||2. As ys is bounded, 


(20.8) fan ) f lttucys' (y)| dy < llallllfllallf'lz < +00. 
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So the integrals can be permuted by LF. 
Moreover as the latter integral in (7) is well-defined for all f’ € L(I'\G), the 
function 


/ flye)dule) = f+ Zily), 


is defined for almost all y by LF, and is locally integrable by definition [Chap. XI, 
n° 12, (ii)]. By (7) and (8), 


i lf’ (y)-f * H(y)| dy < |lell-fllellf’lle 
r\G 


for all f’ € L?(I’\G). As a result, f * @ belongs to L?(£’\G) [Chap. XI, n° 18, 
corollary of theorem 30] and 


(20.9) fEeL(I\G) = f *ii(z) =V(@) f(@) ae. 
(f) This result shows that y * i € H(y). If r > 2, one may choose du(x) = 
w,(x)dx since w, € L'(G). As 
Wr = Wp = Wr k Wr, 
the operator 
(20.10) ft f*w, =V Gr) f 


is an orthogonal projection. However, if f € L?(I’\G), lemma 2 of n° 16, (vi) shows 
that f «w, = f if and only if f € H?(I'\G). Hence for f € L?(I'\G), 


(20.11) few S]ViGf=f (r>2), 


proving proposition (iv). 

(g) Let J~' : H2(G) —+ L?(I\G) be a semilinear isomorphism compatible 
with the representations. We set g = J~'w, and let H(y) be the image of H?(G) 
under J~+, i.e. the closed invariant subspace of L?(I'\G) generated by the V(x)y. 
Proposition (v) of the theorem reduces to proving that y belongs to H?(I’\G) and 
is the only function f € H(y) with this property, up to a constant factor. 

Let J : H(y) —> H2(G) be the inverse map of J~+, whence w, = Jy. For all 
fEH(y), 


(V(x) elf) = (JfIV@) I~ wr) = (J f|L(a)wr) = If (a) 


by definition of the kernel function of H?(G). The function @(x) = (V(x) yl f) 
is, therefore, in H,(G) for all f € H(y), and even for all f € L?(I’\G): replace 
f with its projection on H(y). Like in point (a) of the proof, one may choose 
f(y) = Up(vy) where p € L(G). By (5), 


}; = px pe L?(I\G)NH-(G) = H2(L\G). 
Choosing the pn so that the measures pn(x)dx form a Dirac sequence at e, like in 
any unitary representation of a locally compact group, [Chap. XI, n° 25, (v)], 
y =lLim.?V (in) y =Li.m.? yp * fn, 


where this is the limit in L?(I’\G). However all y*jin belong to the closed subspace 
H2(L£\G) of L?(I\G) for all IP’ (and finite-dimensional if I is a Fuchsian group). 
So the same holds for y. 
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All that remains to be shown is that every y’ € H(y) N H2(I\G) is propor- 
tional to y. As the representation V on H(y), equivalent to the conjugate of 7,, 
is irreducible, H(y’) = H(y). Hence, by proposition (ii) of the theorem, there is 
an isomorphism J’ : H(y’) —> H2(G) mapping y’ onto w,. J and J’ differ by an 
automorphism of the representation on H2(G), so by a scalar since it is irreducible. 
This gives proposition (v) of the theorem. 

Alternative version: V(k)y’ = yr(k)y’ since vy’ € H2(I'\G). But the subspace 
of f € H2(G) such that L(k)f = y,(k)f has dimension one (constant functions on 
the unit disc) and contains w,, qed. It is even possible to refer to the general result 
that will be proved in n° 30: in every irreducible unitary representation (H,U) of G, 
the subspace of solutions of U(k)a = x(k)a has dimension < 1 for all characters x 
of Kk. 

Exercise 3. For r > 2, show directly that J(f *w,) = Jf for all f € H(y). 

(h) To finish the proof, consider two functions vy, € H2(I’\G) and the pro- 
jection P on H(y) in L?('\G). It commutes with all V(«), and so maps H(w) 
onto a subspace in which the linear combinations of PV (x) = V(x) Pw are every- 
where dense. The subspaces H(y) and H(q) are, therefore, orthogonal if and only 
if Py = 0. This condition remains to be shown to be equivalent to (y|q) = 0 and, 
to this end, that Pw is proportional to y. But as P commutes with all V(k), 


V(k)Py = PV(k)p = xr(k) Py 


and we remarked above that the only elements of H(y) satisfying this relation are 
the multiples of y. 

Finally, let (y;) be an orthonormal basis of the space H?(I’\G). The spaces 
H(i) are pairwise orthogonal and contained in the closed invariant subspace 
D, generated by H2(I’\G). Now, the finite sums of functions V(x)y, where 
y € H2(L\G), are everywhere dense in D,. As every y € H2(I'\G) is the limit 
of finite linear combinations of the functions yi, these V(x)y are in the Hilbert 


sum of the spaces H(y:i). Thus D,; = OH (vi). This is a Hilbert direct sum for all 
I’, and a finite direct sum if I’ is Fuchsian. 
Exercise 4. Interpret the theorem in the case I’ = {e}. 


21 — Poincaré, Eisenstein and Maaf-Selberg Series 


The reader will probably have noticed that, contrary to the statement of theo- 
rem 19, that of theorem 26 does not include any point (b). It could be, but is 
not, the author’s “malfunction”, as is politely said nowadays. In this n°, we will 
present methods that, together with very simple series, enable us, like in the case 
of the modular group, to effectively construct automorphic forms and in particular 
analogues for Fuchsian groups for Eisenstein series, following which the omitted 
proposition (b) will become as obvious as for the modular group. 


(i) Poincaré series’. In 1880, wishing to extend the theory to discrete groups 
to which conventional calculations cannot be generalized, Poincaré found a less 


79 Most of the arguments and calculations of this n° can be generalized to the 
much more general case of the Siegel modular group; see my talks in the Cartan 
Seminar on automorphic functions of several variables (1957-1958). They have 
been reproduced for SL2 in Miyake, Modular Forms, chap. 6, probably because 
through Ichiro Satake’s participation in the Cartan Seminar a copy of it reached 
Japan! See also Walter L. Baily, Jr., Introductory Lectures on Automorphic Forms 
(Princeton UP, 1973), interesting in more than one respect. 
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miraculous way than arithmetic for constructing entire (and even, as will be seen, 
parabolic) automorphic forms. An arbitrary discrete subgroup I" of G being given, 
starting from a holomorphic function f(z) on P or even only on an open J’-invariant 
subset 2 C P, it consists in considering the series 


(21.1) Prg(z) = 2 I2) Faz) = DEF) 


hoping they converge. If that is the case, (15.10) shows that the function of weight 
r associated to P,,¢ on the inverse image of 2 under g+—> z = gi is 


(21.1) Peggy > Fea): 


which, hundred years later, makes Poincaré’s idea trivial. If the function f, is in- 
tegrable over 7~'(), the series (1) or(1’) converges in mean — which, if f is holo- 
morphic, most probably implies compact convergence — and its sum is integrable 
mod I’. If I’ is a Fuchsian group and if 2 = P, it will therefore be a parabolic form 
by theorem 26, (c). These heuristic considerations remain to be justified. We do this 
by comparing series (1) to an integral. This method appears somewhat implicitly 
(see below) in Poincaré and was systematically exploited much later. Both Cartan 
Seminars of 1953/54 and 1957/58 on automorphic functions use it extensively, as 
well as Siegel.®° 

We again start from the fact that, for any compact set A C 92 and any compact 
neighbourhood A’ C 2 of A, there is a constant MM, such that the uniform norm 
on A of any holomorphic function f on 2 has an upper bound 


|flla < Mm ff |f(z)| dady. 


To formulate this result in terms of f,, we use (15.30) 


[ltola= [ 


As the variable y stays away from 0 and +oo in A’, in the previous inequality, the 
Euclidean measure dady can be replaced by the invariant measure dm(z), so that 
it can equally well be written 


y’?? f(2)| am(z). 


(21.2) IFrlle < Ma f) Ufola)l dg 


where C and C” are the inverse images of A and A’ under g +> gi. For given A 
and A’, the constant M2 being the same for all holomorphic functions on P, (2) 
may be applied to the left translations L(+) f, of f-. Thus 


TM flle <M ff onda = ae f  lselales. 


However, yC’ #7'C" implies y~'y' € C’C’"' NL. The latter, being a compact 
discrete set, is finite with N elements. Hence each yC’ has non-trivial intersection 


8° Carl Ludwig Siegel, Finfiihrung in die Theorie der Modulfunktionen n-ten Grades 
(ie. related to the group Sp(n,R)), Math. Annalen, 116, 1939, pp. 617-657. 
There are probably older references by the same author, in whose complete works 
can be found considerable ideas and beautiful calculations despite the hostility 
Siegel showed in his old days for new methods (representations and adeles). 
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with at most N sets y'C’, so that the sum of the integrals extended to all yC’ is 
equal to at most N times the integral extended to the union I.C’ of these yC’. 
This is an easy exercise of measure theory. Series (1) therefore converges normally 
in every compact subset provided 


(21.3) fv? t@lam(2) < +00 


for all compact sets A C 92. This condition is also necessary since 
J lela) dg < m(C) sup [F0)| 
yC LEC 


The problem is then to show that there are functions f satisfying condition (3). 
To evaluate the obstacle, let us suppose that 2 = P and that the quotient I'\P is 
compact. A can then be chosen so that [A = P, in which case the condition means 
that f € H;(P). The latter is a non-trivial space if and only if r > 2. In the case 
of an arbitrary group I’, condition f € Hz(P) is obviously sufficient. Moreover, as 
this assumption means that the function f, is integrable over G, the sum of series 
(1’) is integrable over ['\G. As a result: 


Theorem 28. For all discrete subgroups I C G, all open T-invariant subsets 
QC P, allr > 2 and all holomorphic functions such that 


[ oP t@lacay < +00, 
Q 
the Poincaré series P,,¢(z) = )> J(y32z) "f(yz) converges normally in every com- 


pact subset of Q and its sum belongs to H,;(I\Q). 


In fact, initially Poincaré only considered series P,,¢(z) for functions f which, on 
the unit disc D, correspond to polynomials in z. His arguments are a bit different. 
The conformal representation 


zreoC=(z-i)/(z+i)=s8 'z 


used in n° 15, (vi) transforms I’ into a subgroup I’ = s~'I’s of the group G’ of 
automorphisms (15.34) of D. Associating the function 


9(¢) = J(si0)" f(s¢) 


to f, series (1) becomes 


(21.4) ST (V9 “vr 


where summation is over I’. This does not appear to simplify the problem. But 
instead of comparing the terms of the new series to integrals with respect to the 
invariant measure of D, Poincaré uses the Euclidean measure d€dn = du(¢) on 
the complex plane. For any sufficiently small disc A centered at ¢, the area p(y’ A) 
is approximately equal to the product of (A) and the Jacobian of y' calculated 
somewhere in A, namely*! 


ld (y'¢) fag? = [FQ |" - 


8! The Jacobian of a holomorphic map z +—> f(z) is |f’(z)|? because of Cauchy’s 
holomorphy condition: Chap. II, formula (24.9). 
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As p(D) < +00, without strictly speaking using integrals, he concludes that 
S* u(y’ A) < +00 and as a result that 


SF (sO) < +00. 


So it suffices to assume that vy is bounded on D, for example is polynomial, in order 
to make series (4) convergent for r = 4. And if it converges for r = 4, it necessarily 
converges for r > 4. 

This argument, based on 


S-|d (76) /ae|? < +00, 


is simpler that the previous one since it only uses spaces H|; but it does not include 
the case r = 3, which occurs for groups that do not contain —1 (as well as non- 
integral r € ]2,4[), and the assumption that ~ is bounded is too restrictive. In fact 
it is enough for ¢ to belong to the space H,(D). It can be immediately generalized 
to groups acting on a bounded domain of a space C”. 

Characterizing polynomials to which Poincaré confined himself is easy. On D, 
these are precisely the holomorphic functions whose images under rotations centered 
at 0 generate a finite-dimensional vector space. Returning to the half-plane and to 
G, we get functions f, whose left translations g +> f,(kg) under k € K generate 
a finite-dimensional space. They will be called K-finite functions*®” on G. Formula 
(16.15), with n = r + 2p, p > 0, provides a basis for it for all r. 

Exercise 1. What is relation (5) in P? 

Let us consider the simplest case of the function f(z) = (z — w)~? where p > 0 
and choose 22 = P—I"w. Any compact subset of 92 is contained in P—I'V where V is 
a compact neighbourhood of w in P. As 2. C P—V, to show that the corresponding 
Poincaré series converges normally in every compact subset of (2, it suffices to check 
that y’/?|f(z)| is integrable in P— IV, or even more so in P — V. In this open set, 
the ratio |(z — w)/(z — @)| stays in a compact subset of R; since 


ze>(z—w)/(z-W) 


transforms P — V into D — V’, where V’ is a neighbourhood of 0. Hence it is 
enough to check that the function yl? (z —@)” is integrable in P — V. Now, we 
know [Chap. VIII, n° 12, (iv)] that for p > r/2 > 1, it is in the space H}(P), which 
is better. The corresponding Poincaré series P,,f therefore converges normally in 
every compact subset of P— Iw ifp>r/2> 1. 

If w is not a fixed point of I’, the only terms of the series having a pole at w 
correspond to y = +1. If —1 ¢ I’, w and its images are poles of order p of P,.,¢; 
if —1 € I, the result is the same for r even. To deduce that P,, is a (non-entire) 
automorphic form of weight r, its behaviour at parabolic fixed points of I” needs to 
be analyzed. Since f is integrable in P — IV, the series P,,¢ is integrable mod I’ in 
this open set. However, thanks to lemma 1 of the previous n°, for every parabolic 
fixed point € of I, P —I’V contains a horocycle centered at €. So the function P,., + 
is not only holomorphic, but also parabolic at each of these points. 


82 Their importance is not only due to the fact that they are known explicitly. 
K-finite functions can be defined for all groups G and all compact subgroups 
of G. Their role is essential in Harish-Chandra’s theory (G semisimple, K maxi- 
mal compact) and in automorphic function theory 4 la Maa8-Selberg-Langlands, 
because the Lie algebra of G acts almost algebraically on K-finite vectors of an 
irreducible representation. See §8 of this chapter. 
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Theorem 29. Let I be a Fuchsian group. For all r € Z, even if —1 € I’, there 
exist non-trivial automorphic forms of weight r for I’. 


the previous argument proves the theorem for all r > 2: choose f(z) = (z—w)~? 
with p > r. If w’ ¢ Iw is not an elliptic fixed point, the function f’(z) = (z—w’)~”” 
is also appropriate for all r’ > 2 and all p’ > r’. The quotient P,, ¢(z)/P,,¢/(z) is 
then a non-trivial automorphic form of weight r —r’ having poles of order > p at 
the points yw and zeros of order > p’ at the points yw’, qed. 

These results can be refined; see for example Miyake, § 2.6. I will not go any 
further in this direction and henceforth will keep to entire automorphic forms. 


For f € H;(P), though it is already known, using results of n° 16, (i) the series 
P,,¢ can be checked to indeed be parabolic forms for all Fuchsian subgroups I’. 

More generally, let us consider a group I’ admitting oo as a parabolic fixed 
point and set 


Us =INU 


to be the subgroup of matrices (nw) for some w > 0. Then any holomorphic 
solution of f(yz) = J(y;z)’ f(z) has an expansion 


(21.5) f(Z= So ane(z/w)”, an = wi $ FG) e(z/w) "dx 


where integration is modulo w. We need to show directly that if f is a Poincaré 
series, then an = 0 for all n < 0. 

The easiest is to argue in G. We have to expand the function ut P,,¢(ug) = 
> fr(yug), as a U.o-invariant Fourier series. Identifying the matrix x(u) € U with 
the number u € R, the expected Fourier series can be written 


Py, 5 (ug) = S 5 e(u/w)"a an( = dal ug) 


where 
ney Sar I up Muley Mu fer). 


Grouping together the terms yy’ where y’ € Us. and observing that e(u/w) is 
U.o-invariant, we get 


wan(g) =f duS~ Soe w])~" fr (yy'ug) = 
AU 


Uc y! CU co 


fr (yy'ug) , 


lI 
M 
=—— 
= 
ns 
= 
£, 

3 


whence 
wan(g oy [e (u/w)~” fr(yug)du 


This formal calculation, which does not assume f to be holomorphic, is justified 
because exponential factors have absolute value 1. Like the Poincaré series, for all 
g, the series being integrate thus converges normally over any compact subset of 
Ug. For y =e, 
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J Felughelu/isy "du = fI(g:i) Fle + welu/o)"du = 
U R 
= J(g;1) "e(z/w)" f(n/w) 
as was seen in n° 16, (i). There are similar formulas for the translations g +> f,(yg) 
of f, under 7 € I as they are also in H}(G). Hence, since f(t) = 0 for t < 0 for all 
f ©H;(P), an(g) = 0 for n < 0, qed. 
This computation method does not provide any explicit formulas for the coeffi- 


cients dn, since the effect of left translations by y € I’ on f is not simple [n° 16, (v), 
exercise 3]. To get the coefficients a,, summing over the cosets Uso is necessary 
as will be seen later. 


We can at last address the question whether, for a Fuchsian group I’, a parabolic 


form of weight r > 2 is a Poincaré series. Let f € Hi(G) and y € H2(L'\G) bea 
parabolic form. Let us calculate the inner product 


@IP4)= a 4 POPrahdg = Ts to oO ROD = 


= ih y(g)fr(g)dg - 
G 


This formal calculation supposes y(g) f(g) to be integrable over G, which is the case 
since y is bounded on G (theorem 26) and f; is integrable. Hence if y is orthogonal 
to all Poincaré series, then it is also orthogonal to H;(P). However, every y € H+(G) 
orthogonal to H;(G) is trivial because the corresponding holomorphic function yp 
on the unit disc is orthogonal to monomials ¢”. By the way, this shows that it is 
enough to consider the series P,,¢ of K-finite functions. 

Hence, the subspace orthogonal to Poincaré series in H?(I’\G) is trivial. If Fis a 
Fuchsian group, H2(I\G) is finite-dimensional, whence f = 0. Every f € H?(I'\G) 
is therefore a Poincaré series. In the next section, we will be able to give more 
details about this result due to H. Petersson. 


The previous calculations lead to the construction of a kernel function for the 
space H2(I"\G) when I is a Fuchsian group. 

Exercise 1. Suppose r > 2. Let w, be the kernel function of H2(G). It is inte- 
grable. (a) Set 

wr (ty) = So wr (ayy) . 
Show that, for all x, the function y > w/ (x,y) is in H;(I\G). (b) Show that 
(*) oe) = | Fe Helw)dy 
r\G 
for all y € H2(I°\G). In particular, 
[oh @2eF u,2)de = 0 (0,9). 
r\G 


(c) Using the fact that, for all y € H2(I'\G) = H&(I\G), the second integral (*) 
is a bounded function of x, show that 


sup | 
cel JI\G 


r 2 
wr (z,y)| dy < +00 
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and that w} (x,y) is bounded on G x G. (d) Set 
Exot) = ful (w,2)e(e)de 
r\G 


for all y € L?(I’\G). Show that E, is the orthogonal projection of L?(I\G) on 
H2(I\G). Deduce that 


dim H2(I°\G) = i ap x ya) 
r\G 


The rest of the calculation consists in grouping together the terms of the series 
according to the conjugacy classes in I’, in calculating the centralizers of the y 
in G (if y 4 +1, these are well determined conjugates of the subgroups K, A 
or U according to whether 7 is elliptic, hyperbolic or parabolic), in transforming 
the integral over I’\G of the subseries extended to a class into an integral modulo 
the centralizer of y, in checking that everything converges, etc., in other words of 
techniques developed by Selberg for his “trace formula”. 


(ii) Poincaré-Eisenstein series. Supposing that I’ admits oo as a parabolic 
fixed point, which implies Ux. = U(wZ) for some w > 0, we are going to show 
that the Poincaré series can be written in terms of standard series involving 
no “arbitrary” function f. For [ = U(Z), this result will reduce to formula 


YS fet+n)= ye f(n)e(nz) of n° 16, (i) and for SL2(Z), to calculations of n° 17, 
(iii). 
Grouping the terms of the series into cosets yUoo, 


Ss J(4;2) LF z) + nw] = 


Uoo\LP 
(21.6) = wt SO I(2)7" YS fln/wye ly(z)/w]” 


Thus, calculating formally, 
(21.7) wPrs(2) = >— f(n/w)Ern(2) 


with Poincaré-Eisenstein series 


(21.8) a J(y32) "e(qyz)” (r > 2,n > 0) 
Us\P 
which should be written E;.n,.. to indicate that they are with respect to the fixed 


point co. To get functions that are not trivially zero, we need to assume that r is 
even if —1 € I. If 00 is irregular, ’ contains the matrix 


= (9 “) 


and as Toxoy = Uscy U Ucoyoy, the terms of series (8) in y and yoy can be grouped 
together and it can then be summed mod I’. As J(yo7; z) = —J(y;2z) and 


1 


e[y07(z)/w] = e[y(z)/w + 5] = —elr)/4] , 


this gives a factor 1 + (—1)"*” in each term of the sum mod Iss. As a result, 
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(21.9) Eyn(z) =0_ if co is irregular and n+r odd. 


To justify the permutation of the 5> in (6) and, at the same time, to ensure 
compact convergence of series (8), it suffices to prove that series (6) converges 
unconditionally. For z in a fixed compact set, there is (lemma 1) an upper bound 
Im(yz) < T for all y and so 


As a result, 
DY |Ferfee f(z) /u] 
Like S> F(n)e(nz), the second series converges for all f € H;(P) [n° 16, (i)]. Then 


YS 2 Lo Fe/myeln 2)/w)" 


Uoo\P Uoo\P 


‘i < >>| f(n/w) exp( 2nnT)| =M<4+0. 


=a 


(ial. 


so that to justify going from (6) to (7) and compact convergence of series (8) for 
n > 0, it suffices to show that 


(21.10) Sos" < +00, 


Uso\P 


which brings us back to the convergence of the Eisenstein series 


(21.11) Bete Se) See) 


Use\P 


associated to the parabolic fixed point oo. 

If this point, which will be proved in section (iii) in the general case, is admitted 
for now, relation (7) and theorem 26, (f) show that the series P,.¢ are limits of linear 
combinations of series (8). But in finite dimension, every vector subspace is closed. 
As shown at the end of section (i), every f € H?(I°\G) is a Poincaré series. Thus 
the results of these two sections can be summarized as follows: 


Theorem 30 (H. Petersson). Let I be a Fuchsian group. 

(i) For all r > 2, the space H2(I\G) of parabolic forms of weight r for I’ is the 
set of Poincaré series P,,¢ associated to K-finite functions f € H;(P). 

(ii) Every parabolic form of weight r > 2 is a linear combination of Poincaré- 
Eisenstein series. 


If I’ contains the matrix —1, r is even and the general term of (11) only depends 
on the coset of y modulo the subgroup 


of matrices for which c = 0, d = +1. So the sum is twice that obtained by summing 
mod I... To avoid this factor 2, in the case of SL2(Z), a factor 4 was introduced in 
(17.20) in definition (11). We will not do so here in order to get formulas for all I’. 
If the fixed point oo is irregular, the formal argument leading to (9) applies equally 
for n = 0. Thus the series E;,.. are trivial for r odd. 


Exercise 2. (a) Show that 


Sl wr(zt+n) = So n™ tel n(z +%)| 


n>0O 
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up to a constant factor. (b) Show that 


door (y9) = D5 exp(—207/w)"n™™ Ex.n(9) 


n>0 


up to a constant factor. (c) Using exercise 1, recover theorem 30. 


Another reason for the importance of the series E,, studied by Petersson but 
also found in Poincaré*? for the modular group, follows from the next calculation. 
Let f be a holomorphic solution of the functional equation f(yz) = J(y;z)" f(z), 
without any other restrictions for the moment, and let us compute the integral 


(2112) (flEnn), = is VLE @am(e) =f fol Fvn eg, 
where 
Ern(g) = J(93t)" Epn(z = Soe, (yg; n/w) 


and 


er(g;t) = J(g;t) “e(tz) 


are the functions of weight r associated to E,.»(z) and e(tz) on G. Let us suppose 
that the function under the f sign in (12) is integrable. As f-(yg) = fr(g), 


(flErn) =/.. dg S~ fr(yg)er(yg;n/w) . 


User 
Thus 
(F[Enm), =f felg)er(ginjohdg = 
Uoo\G 
(21.13) = y” | f(z)e(nz/w)dad*y. 
Uco\P 


Using the Fourier series f(z) = }> ape(pz/w) and integrating term by term, we get 
(21.14) (f|Ern)» Pee e(pz/w)e(nz/w)dm(z). 


Replacing z by wz, leaves the measure invariant, replaces Us. by U(Z) and multiplies 
y’ by w”. Integration is then via an integral over x mod Z, which is zero for n # p. 
For n = p, the product of the two exponentials equals exp(—47ny), thus the final 
result follows readily: 


(21.15) (f|Ern), = (4an)'"" P(r — lw" an. 


83 See Fonctions modulaires et fonctions fuchsiennes (CEuvres completes, end of vol- 
ume II), a 1912 article sent to Ann. of the Fac. of Sc. of Toulouse on the eve of his 
admission to a clinic for a prostrate operation which went wrong... Formula (12) 
is already in there for particular functions f (those corresponding to monomials 
¢? on the unit disc), and so are the Fourier series of the functions En (n > 0) 
on the modular group and the arithmetic sums occurring in their coefficients. 
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These calculations still remain to be justified assuming f to be parabolic. To get 
(13), it suffices to show that the integral obtained extended to the strip | Re(z)| < 
w/2 of P is convergent. Now, the function y"/*f(z) is bounded on P if I is a 
Fuchsian group, the exponential is rapidly decreasing at infinity since n > 0 and 
the function y"~' is integrable with respect to d*y in the neighbourhood of 0 since 
r > 1. So it clearly converges. 

To justify going from (14) to (15), it suffices to show that 


S~ lap I pi lelmz/elnz/i)|dm(z) < +00. 


The integral is proportional to (n + p)*~" = p'—", whence the condition 


>» |ap|/p"* < +00. 


As we know that ap = O(p"/ 2) for any parabolic form, assumption r > 4 justifies 
the calculation, but leaves open the cases r = 3 and r = 4. The question is academic 
for the modular group. If I’ is a congruence group, Deligne’s upper bound 


ap =O (peo) 


can be used. This result, which does not come very cheap, takes care of the question 
for r = 4 but not for r = 3. To make the series convergent in this case, we need an 
estimation of the genus in O(p'~‘), but these gentlemen say that it is not possible 
to go beyond O(p'**). 

In conclusion, The Fourier coefficient with index n > 0 of any parabolic form f 
of weight r is the inner product of f and the Poincaré-Eisenstein series Ey», up to 
a simple factor, at least for r > 4. 


(iii) Eisenstein series. They do not generalize the series G,(z) of the modular 
group, which are not well-defined for non-arithmetic groups I’, but the reduced 
series 


(21.16) Ex,oo(z)= >) F(z) 


Uso\l 


introduced in (17.20) with a factor 3. They are defined for any discrete group I 
having oo as a parabolic fixed point. For r odd, as was seen above, the series is 
trivial if —1 € I or if the fixed point oo is irregular. 

The convergence of these series, obvious if 7 C SL2(Z), requires a more difficult 
proof for other groups, but its principle can be applied to multiple Eisenstein series 
associated to arithmetic subgroups of general semisimple groups. In fact we will 


study the Maaf-Selberg series®4 


54 Hans Maa8, Uber eine neue Art von nichtanalytischen automorphen Funktionen 
und die Bestimmung Dirichletscher Rethen durch Funktionalgleichungen (Math. 
Annalen, 121, 1949, pp. 141-183), Atle Selberg, Harmonic analysis and discon- 
tinuous groups in weakly symmetric Riemannian spaces (J. Indian Math. Soc., 
20, 1956). There is also, for all r € Z, a series 


Mr,0o(2, 8) = Se J(y32z)" Im(yz)°-"/? ; 


These series, which satisfy differential equations which we shall prove in § 8, en- 
able us to write down explicitly the “continuous spectrum” of the representation 
of G on L?(I°\G), hence their importance. 
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(21.17) Moo(2;8) = So Im(qz)* = SD alyg)* 

Deg \F Doo \P 
where s is a complex exponent. As |.J(y; z)|7? = Im(yz)/Im(z), convergence will be 
obtained for (16) when r > 2 by showing that series (17) converges for Re(s) > 1. 
Summation mod/’. aims to ensure that the “constant” term, i.e. independent 
of x, of the Fourier series 


Moo(z; 8) =} an(y)e(na/w) 


of the function is, as will be seen, of the form ao(y) = y* + a(s)y'~* without any 
superfluous factor 2 if Uso # Ix. However, summing mod J’~ is problematic for 
the series 


(21.17’) My,oo(25 8) = > J(y32)7* Im(y2z)° 
Too\0 


when Ix, 4 Us. One obviously needs to assume r is even if —1 € I or else if the 
fixed point oo is irregular. In the latter case, summing mod U. is certainly possible 
for r odd, but the result would be zero. 

We will set 


a(g) = |J(9;4)|* = Im(gi) 


as in n° 15, whose notation we will keep. For k € K, u € U, h = h(t) € A and 
z= gt, 


uhgi = uhz =? z+u=a(h)z+u, 
whence 
a(uhgk) = a(h)a(g). 


The Haar measure of G is given by 


y(g)dg = / i y(bk)dbdk = / i i y(huk)dhdudk = 
= /// y(uhk)a(h)~ dudhdk . 


g +> gi transforms the horocycles centered at oo into Siegel domains 
6(T):g=uhk with a(h) > T 
of (19.9) and the horocycles centered at € = xoo into their translations r6(T). 


Theorem 31. For every discrete subgroup I of G having a parabolic fixed point 
at infinity, the series 


(21.18) Mx(9;8) = S> a(yg)° = >_ Im(y2)’ 


Paghk 


converges absolutely for Re(s) > 1. For any parabolic fixed point € = xoo of I’, the 
series converges normally in every horocycle G(€), unless € = coo (o € I’) belongs 
to the same class as the point oo. In this case the series with the term a(a—‘g)* 
removed converges normally in G(&). 
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Again, the proof amounts to comparing the series with an integral. 
Lemma 1. Normal convergence in G(E€) is equivalent to convergence at x € G. 


To prove this it suffices to find an upper bound 
a(yrg) < Ma(yx) 


valid for y ¢ Io and g € G(T). 
Let us first show that there is a constant M such that 


(21.19) a(yg) < Ma(y) 


for all y € G— B and all g € G(T). The ratio between both sides being invariant 
under y > uhy, one may assume that y € K, a compact set in which a(y) stays 
away from 0 and +00. So it suffices to prove that a(yg) is bounded for y € K-KNB 
and g € G(T). Then (Bruhat) y = huwu’ where, h, u and wu’, like y, stay in compact 
sets. In a(yg) = a(h)a(wu’g), u’g stays in UG(T) = G(T), which gets rid of u’. 
On the other hand, the factor a(h) is bounded since y € K. Hence the function 


gt a(wg) = Im(wgi) = Im(—1/z) where z = gi 


remains to be shown to be bounded on G(T’), but this is obvious. 
This being settled, let us return to series (18) in G(T). By (19), 


(21.20) a(yrg) < Ma(yx) for yz¢B, g€ G(T). 


If the fixed point € belongs to the same class as oo, i.e. if x = ob for some o € I" 
and some b € B, clearly, 


yE BSS 0 ET x - 


Thus (20) applies to all terms of the series except a(a~'g). Therefore, normal con- 
vergence in G(T) of the series with this term removed follows from its convergence 
at x (or any other point). If, on the other hand, € is not equivalent to the fixed 
point oo, since yx € B, y€ = ov, which is impossible. In this case, (20) applies to 
all y, qed. 


Lemma 2. Let y be a continuous function with values > 0 on G such that 
(21.21) y(uhg) =a(h)*y(g) forsomesER. 


For all compact sets C C G with measure > 0, there exist constants m, M > 0 such 
that 


mip(x) < [ eCeaiaa < My(z) 


for allz eG. 


Let y’(x) denote the integral. The function y’ is continuous, satisfies (21) like 
y and is everywhere > 0 since m(C) > 0. The ratio y(x)/y'(x) > 0 is invariant 
under « —> wha, and so takes the same values in G as in K. Since K is compact, 
in G, it stays away from 0 and +o, qed. 

As the MaaB series is of the form }> y(yg) where ¢ satisfies (21), we might as 
well consider the general case. For all x € G and all CC G, 


etre) ff etrea)da = f 


x 


7 p(g)dg = [., plg)dg 
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where C’ = xC, which reduces to the proof to the convergence of the series 


As C is only required to be compact and have measure > 0 and as I is discrete, we 
may suppose that C is sufficiently small so that, for given x, the yC’ are pairwise 
disjoint. Convergence of the previous series then amounts to saying that y(g) is 
integrable over ,\T°C". 

But the map g —> gi = z from G to P transforms C’ into a compact subset W 
of P and 'C’ into a set 'W contained in the strip Im(z) < T for some T < +00 by 
lemma 1 of n° 19, (i). Therefore, the set C’ is contained in the complement of a 
Siegel domain. So it suffices to show that y(g) is integrable over the set of g = uhk 
where u stays in a compact set (since integration is mod I.) and where a(h) < T. 
As was seen above, 


dg =a(h)~‘dudhdk for g = uhk. 


Hence convergence reduces to that of the integral 


b 
/ au | e(kjak f a(h)*~ ‘dh 
a K a(h)<T 


where [a,b] is a compact interval of R. For h = diag(t,t~'), a(h)*~? = #67) and 
dh = d*t. Thus the condition Re(s) > 1 follows, ending the proof of the theorem. 


The latter applies to series with respect to the point at oo, but a similar series 
can be associated to any other parabolic cusp € = xoo of I’. It suffices to apply 
the formula to the group I’ = «~!I'x for which oo is a parabolic fixed point. This 
gives the series 


> at7'9)* = So a (27 *y29)' 


DEAT! Te\C 


To get a I’-invariant function, we replace g by x~'g, whence the series 


So a( x *yg)° = rice *sz)| )}-7* Im(yz)* = 
(21.22) - S = Im ( x yz)* 


where summation is mod I. By the previous theorem, the series associated to I”, 


with possibly one term removed, converges normally in every horocycle of P(r’). 
However, the parabolic fixed points of I” are the €’ such that «~'yxé’ = €’ for some 


y # +1. Hence these are the points x7 where 7 € PIL ), which, by the way, shows 


that P(r’) = aP(L). Since g ++ x ‘g transforms the horocycles of I” into those 
of I’, it follows that, for any parabolic fixed point 7 of I, series (22) converges 
normally in G(7) unless obviously 7 = o€ is equivalent to €, in which case the 
result continues to hold provided the term a(2~'og)* which increases indefinitely 
as z tends to 7 in the topology of horocycles is removed from the series. 

Series (22) depends on the choice of x such that € = xoo. But since x ++ xb 
multiplies a(x~'g) by a(b~') for all g, changing 2 multiplies (34) by a constant. 
A result independent of x is obtained by dividing all the terms of the series by 
a(x~')*, which leads to the series 
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Qa (xt) ~* s Qa (x~*yg)* = (e + d’)° S> Im (x *yz)" 


Dg\P 


where (c d) is the second row of the matrix 2+. For (c d) = (1 — €), (22) is the 
function 


(21.23) Me(z38) = So yz — €-?* Im(qz)° = STI [-1/(y2 - O°. 


De\P 


This obviously supposes € 4 co. As the transformation z ++ —1/(z—&) of G maps 
the point € to infinity, series (23) with respect to I’ and to € is clearly obtained by 
applying x to the analogous series with respect to I’ = x~'Ix and the point at 
infinity. 


We can now return to classical Eisenstein series 
Byoo(2) = So Jq32)" 
where summation is mod Ux or, up to a possible factor 2, mod Ix. As 
y? \F32)| =Im(qz)"”, 


comparison with the Maaf series is immediate: the series converges for r > 2. The 
Maaf series converges normally in all parabolic cusps with vertex € not equivalent 
to oo, so that 


y"!? Er,co(2) < M,-/2,00(2) 


is bounded and as a result parabolic at these points. On the contrary, in a cusp at 
a vertex € equivalent to oo, we have to remove the v < 2 equal terms if the series 
is not trivially zero, for which Usoy C Isc. So, for € = ov, 


y"? Erco(z) = vy"? +O(1) for large y, 


whence a Fourier series expansion of the form 


Exoo(z) =U t sy ane(nz/w) =v +O ae exp(—2ry/w)| : 


n>1 


The series is, therefore, holomorphic but not parabolic at oo. On this matter, recall 
[n° 19, (ii)] that a holomorphic form f at a fixed parabolic point € is parabolic at 
€ if and only if the function y"/?|f(z)|, and not f(z), is bounded in a horocycle 
centered at €. 

Denoting by 


a(g) = J(g;i)"a(g)””” 


the function of weight r on G associated to y"/? 


clearly becomes 


, the function associated to Ey,oo(z) 


Er,co(9) = >_ ar(y9), 


where summation is mod Uz. To associate an Eisenstein series to a parabolic fixed 
point € = x(00), we replace I by I’ = «~'Ix. This gives a series 
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E,c0(9) = >, ar(7'g) 


with respect to I’. In accordance with the basic principle of n° 19, (ii), the expected 
series E,,¢(g) is given by 


Thus it easily follows that 


E,e(z) = S- F(x *y2)" 


Ue\D 


Conclusions are obviously the same as earlier: if € is irregular and r odd, the series 
is identically zero; if € is regular or if r is even, then 


#0 if 7 is equivalent to €, 
Er.e(n) = 0 otherwise. 
If f is an entire automorphic form of weight r, f can therefore be made parabolic 
at € by subtracting a scalar multiple of E,,¢ from it. This operation does not alter 
f() at parabolic fixed points 7 not equivalent to €. Hence the next result which 
generalizes what we already know about the modular group: 


Theorem 32. Let I’ be a Fuchsian group. Every entire automorphic form of weight 
r > 2 is, in a unique way, the sum of a parabolic form and of a linear combination 
of Eisenstein series associated to parabolic fixed points of I’. 


For r even, the dimension of the vector space generated by the Eisenstein series 
is the number of classes of parabolic fixed points. For r odd, it is zero if —1 € I 
and equal to the number of regular fixed points otherwise. 


22 — Fourier Series Expansions 


(i) General method. The computation of the Fourier series of the functions E,(z) in 
n° 17, (iii) uses the series G,(z) and, on this account, cannot be applied to general 
discrete groups. Computing the Fourier coefficients of an automorphic function is 
a problem encountered in many other situations, either when the assumption of 
holomorphy is abandoned, or when more general groups J’ than the modular group 
are considered. To unify them, we consider G and a series of the form 


(22.1) Eo(9) = >_ 9(y9)- 


Summation is over I’ in the case of Poincaré series and of a function y integrable 
over G. In the case of a series associated to a parabolic fixed point which will be 
assumed to be oo, the function ¢ is left '.-invariant and summation is over I.\I. 
We will consider the second case since replacing y(g) by >> y(yg), where summation 
is over Ix0, reduces the first one to it, as was seen for Poincaré series. 

Expanding FE, in a a Fourier series in the neighbourhood of a parabolic fixed 
point € = s(oo) of I’ reduces this case to expanding the function 


Eo(s9) = 3° e(ysg) 


Doo \P 


at infinity. To this end, we group together the terms belonging to a same double 
coset IxocUg, hence of the form Pxoyy' with y' € Uz. For y¥',7" € Ug, 
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toW-1 


Door = Dory" => yo" * € Us NY Doo'y = Ue(y) - 


This relation is equivalent to Ue(y)y' = Ue(y)y". So one has to sum over the classes 
Ue()y', and so 


=> SS ¢v(w’'sg) 


Doo We Ug (y)\UE 


However, the 7’ € U¢(7) satisfy y'€ = € and y7'y~'o0o = ov, hence leave invariant € 
and y~1oo. Hence, since a parabolic matrix other than +1 has a unique fixed point, 


(22.2) Ue(y) = {e} if 1 Foo 


and summation over ¥’ is then extended to Ue. Therefore, 


(22.3) Ey(sg) = oS Soe (yy'sg) if € not equivalent to oo. 
Doo Weg Ug 


On the contrary, 
(22.4) Ue(y) =Ue if YE =O. 


In this case, one may assume s = y~'. Thus E,(sg) = E,(g). Then one might as 
well keep to the case € = oo. The relation yoo = oo is equivalent to y € ['x., which 
gives a term equal to y(qg) in the series E,(g). If y € [oc, then again Uxn(y) = {e} 
and summing over all 7’ € Use is necessary. So 


(22.3) Fo(9)=9(9)+ SY) Sov(’g) 
Don VWoo Uoco 
VEL oo 


The formula is the same as in the previous case except for a constant term. These 
formal calculations are no problem if the series E,,(g) converges unconditionally. 
In both cases, Ue = sU(wZ)s~+ for some w > 0, whence 


S58 (17's9) = 35 ¢ fysa(nw)g] 
Ue Z 
Introducing the Fourier transform 


(22.5) ar (g',g) = Je [g'x(u)g] e(—tu)du 
defined independently of all discrete groups for g’ € G — B, 
(22.6) ar [g',x(v)g] = at (9',g) e(tv). 

At least formally, the Poisson summation formula shows that 
(22.7) doe [ysx(nw)g] = "SF anjus(78,9) - 
It then follows that 


(22.8) E,(sg) = 6(€) + So Sow anu (18,9) 


TooyUe Z 
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where 6(€) = 1 or 0 according to whether € is or is not equivalent to oo, and where 
summation is over all cosets except the coset I. if € is equivalent to oo. Therefore, 
by (6), the expected Fourier series expansion is given by 


(22.9) Eg [sx(u)g] = y(sg) + Ss an(g)e(nu/w), 


(22.9’) an(g) = ae Ww" On/ (8,9) 
Doo We 


with the same restriction on the summation as above. 
These formal calculations remain to be justified. Supposing the series Ey(g) 
converges normally in every compact set, the series 


So vlysx(wu+wn)g] , 


n 


obtained by grouping the terms of the coset I[.yUz considered and by replacing 
g by x(wu)g, converges normally in all compact subsets of U, so that formula (5) 
is well-defined. To go from here to (7), it suffices to assume that the series on the 
right hand side of (7) converges absolutely.®° As going from (8) to (9) presumes a 
permutation of summations, the easiest is to assume that 


(22.10) pS s |an/w(s,9)| < +00. 


TooyU~ Z 


If y = f, for a Uso-invariant function f(z), then 


(22.11) Ey(g) =J(9;t) "Es (z) where E,.¢(z a J(y;2) "Ff (yz). 
To\r 


In this case, 


Thus, 


at (9',9) = J(954) aa eat! z+u) ' f [g'(z+u)] e(-tu)du = 
— J(g;i)" at (g’ ,2) ’ 
where we have set 
az (g',2) = fo (g;2+u) "f[g'(z+u)] e(-tu)du 


or, for g € G — B with second row (c d) and z € P, 
(22.12) Sele i (c¢ +d)" f(g¢)e(—tC)de . 
Im(¢)=y 


8 For a continuous function f on R such that the series > f(x + n) converges 
normally on every compact set, > f(a +n) = Yo f(n)e(nz) if 37 |f(n)| < +00 
because every periodic continuous function whose Fourier series converges abso- 
lutely is equal to its sum (Chap. VII, n° 6, theorem 2 or n° 27, theorem 24). 
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The integral does not depend on z, but on y if f is not holomorphic. Using (g) and 
(g'), and setting 


(22.13’) Qn(z) =w* oe An jw (YS, Z) = an(iy)e(na/w), 
Doo We 

finally, 

(22.13) J(s;z)”" Ep, ¢(sz) = O(€)f(sz) + Se Gn(z) . 


The general term on the right hand side is indeed invariant under z +> z+w, and 
so is the term f(sz) since sU(wZ) = U..s when € is equivalent to oo. If the function 
f is holomorphic and bounded on P, we will see that integral (12) is independent 
of y and zero for t < 0, the function a, /.,(7s, z) is proportional to e(nz/w) and 
(13”) reduces directly to a series of the form >.) dne(nz/w). 


(ii) The case of Poincaré-Eisenstein series. Let us consider the series 


Evp(z)= >> IQjiz)"elyzy?  (r > 2, p20). 
ENE 


This is series E,,¢ where f(z) = e(pz) is holomorphic and bounded on P. For 
g€G-B, 


(22.15) e(-t2)ar(g,2) = | (e¢ +4) "e[p.g(6) — teh dc. 


Integration is over the horizontal Im(¢) = y of P, but the result does not depend 
on y. |e(—t¢)| = exp(27tn) is bounded on all of the closed horizontal strip B C P 
of finite width, and so is f[g(¢)]. Thus the function being integrated is O(¢~") at 
infinity on B with r > 2. The result then follows from Cauchy. 

This being so, the Bruhat decomposition 


gz = (az +b)/(cz +d) = —1/c?(z +. d/c) +.a/c 


shows that 
(22.15’) e(—tz)ar(g,z) =c “e[(pa 4 td)/c] / ¢ "e (—p/c?¢ — tC) d¢. 


As p> 0 and —1/c*¢ € P, the function e(—p/c?¢) is bounded on P. So is e(—t¢) if 
t < 0. The function being integrated is then holomorphic and O(¢~") on Im(z) > 
T > 0, whence 


(22.16) ar(g,z)=0 fort <0 


again thanks to Cauchy. 
For t > 0, two cases need to be distinguished. 
(a) The case of Hisenstein series E,(z). As f(z) =1, 


e(—tz)ar(g, z) = c"ettd/e) f ¢re(—te)ac 


The function being integrated is O(¢~") at infinity on all of the half-plane Im(¢) < T 
and holomorphic for ¢ 4 0. Hence integration over the horizontal can be replaced 
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by integration counterclockwise around a circle centered at 0. The result is the 
product of —27i by the residue at ¢ = 0, namely 


(—27)" 
I(r) 


Taking into account the factors w, expansion (13) in the neighbourhood of the cusp 
€ = soo is therefore 


(22.17) e(—tz)ar(g, z) = “"e(td/c)t”'. 


(22.18) J(s;2)~"E,(sz) = 5(€) + S> ane(nz/w), 
n>0 
where 
) _ (—2n1)" -1 —r 
(22.18’) an = Lo RA e(nd/cw)c 


and where (c d) is the second row of the matrix ys, but not of 7. As 
Do Wes = DPxoysU (wZ) 


by definition of w, the coefficient c of ys only depends on the coset [4.yUe. The same 
holds for the exponential since multiplying ~ by an element of Ug = sU(wZ)s~ 
adds a multiple of cw to the coefficient d of ys. Finally, we sum over all cosets 
Toye if € is not equivalent to oo and all cosets other than Ij, if € = oo. 

Supposing € = ov, let us for example consider the Eisenstein series of the 
modular group. Summing over ¥y for which c > 0 is sufficient. The arithmetic nature 
of I’ plays a role here: 


Lemma 1 [I = SL2(Z)]. Let y and y' be two matrices of I. such that c > 0, 


ce > 0. Poo = Tny'Uco if and only if 
(22.19) c=c, d' =dmodc. 
Obviously, PxsxyUoo = Dvoy'Uco if and only if UssyUoo = Ucoy'Uco since c > 0, 


c’ > 0. On the other hand, 


imam =(9 T)(E a) o T)=( a) a7 


is equivalent to c’ = c,a’ =a+me, d’ = d+ m’c, hence proving the necessity of 
(19). Conversely, (19) implies ad = a’d’ = 1modc. Applying arguments to the group 
(Z/cZ)*, we see that a’ = amodc is equivalent to d’ = dmodc. Conditions (19) are 
therefore sufficient, qed. 


Hence 
Qn = (—2ni)"n"1/T(r) ye me > e(nd/c) = 


c>0 dmodc 
(c,d)=1 
- 
= (—2ni)"n" r)> Te( 
c>0 


where these are Gauss sums. Since it was shown in (9.34) that 


Ss De(n)c* = n*Sa5-1(n)/C(s) 


c>0 
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for Re(s) > 1, the sum of the previous series is equal to 


and we recover (17.25). 
(b) The case of Poincaré-Hisenstein series E,,)(z),p > 0. Now f(z) = e(pz) 
and (15’) includes an integral of the form 


(22.20) [erence — v/¢) dC 


taken along a horizontal Im(¢) = T of P, with u > 0 and vu > 0. To reduce it toa 
standard form, we analyze the function under the f sign on the set 


Im(¢)<T, |¢]2e@>0. 


je(—uC)| = exp(27un) < exp(27uT), and as |1/¢| < 1/0, the function being in- 
tegrated is O(¢~"). The integration horizontal can thus be replaced by a circle 


centered at 0 followed counterclockwise. Choosing the circle ¢ = (v/u)? exp(iy), 
d¢ = i¢dy and 


As a result, 
[erent = v/6) a6 = 
(22.21) = =i(v/uy' f exp [—4riA cosy + (1—1r)y] dy 


where integration is from 0 to 27 and where \ = (uv) 2. The simplest Bessel func- 
tions 


J (w) _ | aa _ Ferre = 


(22.22) = (w/2)" > (-—w*)" /2?"nl(n +k)! (k > 0) 


n>0 


occur and we find 
(22.23) / ¢-"e(—ud — v/C) dé = —a" (v/u)9-?”? J (40d). 


This being settled, since 


e(—tz)ax(g, 2) = o7"e [(pa + td) /o] i, (-e (—p/e¢ — 10) de, 


e(—tz)ar(g, z) = 
(22.24) = -i" (t/p)-Y/2e-* |e" 1e [(pa + td) /c] Ina (47 Vip/) 


Then the general formula 
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J(s;z) "Ey, (sz) = 0(E)f aides On /w (YS; 2) 
involves the exponentials 
e [(paw + nd) /cw] , 


where a, c and d are the coefficients of ys. Multiplying y on the left by a matrix 
x(m) € I's, and on the right by an element of Ue, hence of the form sx(m'w)s~! 
with m,m’ € Z, the matrix ys is multiplied on the left by z(m) and on the right 
by x(m’w). As 


1 m)\ fa b\ (1 m'w\ _ fat+me ? 
O 1l}/\ed/\O 1) c d+m'cw)’ 


the value of the exponential does not change, which forebodes well for the exactness 
of calculations. Moreover, the coset U.yU¢ is characterized by the value of c and 
the values of amodcZ and dmodcwZ. Then the cosets I.yU~ can be grouped 
together in (13’) according to the value of the coefficient c of ys. The sum 


paw + nd 
(22.25) PP e( = \= Kp,n (Uso We) 
dmod cwZ 


occurs as a factor. Thus 


(22.26) J(s8;z)”" Er,p(sz) = 6(€)e(p.sz) wS > ane(nz/w) , 
Gn = — ae 
(22.26’) x SE NORAD da [ar Vp/ne| 


Doo Weg 


It is not very easy. 
For I’ = SL2(Z) and € = co, s = e, the coefficients a, c and d are integers and 
lemma 1 shows that sums (25) reduce to Kloosterman sums 


Kyn(u,v) = oS e [(ux + vy)/n] 


x,ymodn 
xy=lmodn 


that occur elsewhere in number theory and that, for v = 0, reduce to Gauss sums 
I’. All this can already be found in Poincaré’s work of 1912. For given n # 0, the 
expression e(a/n) only depends on the class of xmodn and as a result defines a 
character x +—> e(x) of the additive group of the ring A = Z/nZ. Kloosterman 
sums only depend on classes of wu and v and are then written as 


Kau») = f e (ux + vr ') d*x, 


where integration is over the multiplicative group A* with respect to the invariant 
measure (weight 1 at each point) on this group. It may seem ridiculously scholarly, 
but shows that these sums are arithmetic analogues of classical Bessel integrals. 
Similarly, Gauss sums are arithmetic analogues of the I’ function. Adelic meth- 
ods would enormously simplify these calculations and enable us to deal with all 


434 XII — The Garden of Modular Delights 


congruence subgroups of SL2(Z) simultaneously. The same remark applies to the 
following. 


(iii) The case of Maap-Selberg forms ; analytic extensions. For the functions 


Moo(z;8)= 2 Im(yz)’ =¥° D2 


Poo\P 


r =O and f(z) = Im(z)°. So now, setting 
(22.27) W.(t) = iE (1+u?) * e(—tu)du = Ws ((¢)) 
R 


for Re(s) > 3, t € R, an unorthodox but convenient notation, we get 


wee pm eral ede 
=y° if [(ca + cu +d)? +. c7y*] * e(—tu)du = 
=y" *|el**e(td/c)W.(ty)e(tz) . 
Exercise 2. Show that 
So y*/lz+ nl? =y"* S° Wz (nly) e(nz) 


pour Im(z) > 0, Re(s) > 1. 
Exercise 3. As usual set a(g) = y for z = gi. Show that 


fe [wa(u)g]* e(—tu)du = y°W.(ty)e(tz) . 
The coefficients of the Fourier series 


(22.28) Mxo(xz; 8) = So an(y; s)e(nx/w) 


at a parabolic cusp®° € = xoo, are of the form 


(22.29) ao(y; 8) = 5(E)y* + a0(s)We(0)y'*, 
(22.29 an(y; 8) = an(s)Ws (nly /w) (n #0). 
The first term y* on the right hand side of (29) is the contribution of the coset 


I'5.U¢ in the case where € is equivalent to oo. The numerical coefficients an(s) are 
the series 


(22.30) an(s)= > ele (nd/ew) , 


Doo We 


where (c d) is the second row of ya and where the sum is subject to the usual 
restrictions. 
In the case of the modular group, lemma 1 shows that 


86 Change s to x so as to avoid confusion with the variable s. 2 was changed to s 
above in order not to have to write expressions such as x2(t). 
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=o $2 e(nd/e) = 01-25 (|nl) /¢(2s) 


c>0 dmodc 
(c,d)=1 


ifn #0. For n = 0, I’.(0) = Card[(Z/cZ)*] = y(c), i.e. the Euler indicator (number 
of classes). Thus 


=~ ylc)e** = ¢(28 — 1)/¢(2s) 


by (9.33). Hence, the “constant” term, i.e. independent of x, of the Fourier series 
of 


s) = So Im(y2)’ = Mao), 
is 
(22.31) (28)ao(y; 8) = C(2s)y° + 6(28 — 1) Wa(O)y!* 
with (see further down) 


— 1/2)/P(s) = 


(22.32) =n" 2) Pr (5 —1/2)/m°T(s). 
Introducing the function €(s) = 7~*/?I'(s/2)¢(s), it follows that 


€(28)M(z; 8) = €(2s)y° + - = sae ce 


(22.33) +n °I(s)y'* S~ o4_25(n)W; (nly) e(na) 
nZ0 

(22.34) €(2s)M(z;s) = a0(y3s) +) an(yss 

with 

(22.35) ao(y; 8) = E(28)y* + €(2 — 28)y'*, 

(22.35’) On(Y; 8) = o1-25(n)m *I(s)y' *We (\nly) (n £0). 


The term ao(y;s) being clearly invariant under s +> 1 — s, conjecturing that the 
same holds for an(y;s) makes sense. This requires a simple relation between Ws 
and Wj-_s. Integrals (27) have been the focus of extensive studies, but it is best to 
follow Jacquet and Langlands.®” We start by writing that 


nm *IP(s) (uw? +1) = ye exp [—mt? (u? +1)] *d"t for Re(s) > 0. 
So 
5m T(s)Wely) = / e(uy)du if exp [—mt? (u? + 1)] #*d*t = 

= [exw [-7 (y?/t? +0?) Po *d"t 


87 Automorphic Forms on GL(2) (Springer, Lecture Notes 114, 1970). 
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for all y € R. This leads to (32) for y = 0 and, for y > 0, 


(22.36) 5m T(s) Waly) = yo? exw [-my (7 +477)] 787 at. 


Introducing Bessel functions 


+00 
Ki(y) = a exp [—y (t+t7*) /2] t’d*t, 


defined for all vy € C and y > 0, (35’) can be written as the Maa form 
(22.35”) on(y;8) = 2|n|* 2 01-26(n)y? K,_1 (2n|n|y) (nm #0), 


but it is best to preserve the W;, especially so as not to fall under the temptation 
of becoming absorbed in the contemplation of the 12.345 formulas obtained by 
specialists of Bessel functions who have never come across SL2(R) and omit the 
factor 27 in the exponentials defining the Fourier transforms. 

This calculation is justified provided 


f? Re(s) exp (—7t?) a't [ exp (—at?u7) du < +00 ’ 


hence for Re(s) > 1/2. But integral (36) converges for all s since, at infinity or 0, 
the exponential under the f sign tends to 0 more rapidly than any power of t. As 
a result, 


n*I'(s)We(y)y? > 


is an entire function of s invariant under s + 1—s. As 1/I(s) is an entire 
function, so is W,(y) for all y > 0. Observing that 


Ino" o1~25(n) = oa did; * 


dj dg=|n| 


is invariant under s ++ 1 — s, we readily see that, for n 4 0, an(y; 8) is an entire 
function of s invariant under s+ 1-<s. 

To deduce a similar result for the function €(2s)M(z; s), it suffices to show that, 
for all y > 0, the series )> an(y; s)e(naz) converges normally in all compact subsets 
of the plane of the variable s. Now, 


5m T(s)We(y) - exp(—2ry)y* 2 [exw [-ry (t- ey eas ie 


The exponential is a decreasing function of y, while, on the strip 1—o < Re(s) < o, 
the function |t?*—'| remains between its values at o@ and 1 —. So in a set such as 


(22.37) y>T>0, 1-a0<Re(s) <o, 


there is an upper bound of the form 


(22.38) 5 °T(s)Waly)| < (Ts) exp(—2my)y"—3 , 


where, using (35’) and a trivial upper bound of o1~2s(n), 
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lon(yss)| = 20 (n™ exp (—2n|n|y)) . 


The precise value of the exponent N does not matter much since the result obtained 
suffices to prove normal convergence in (37) of the Fourier series of €(2s)M(z; s) 
with its “constant” term 


(22.39) £(2s)y* + £(2 — 28)y!* = €(25)y* + (28 — 1y'* 

removed. Its singularities are [n° 2, theorem 1] those of the function 
ae IN i ied, 1 

Gee) s (GH =) ee (ss =! 


The two simple poles at s = 4 cancel out and the residues at 1 and 0 are 4 and 


—t. In conclusion: 


Theorem 33 [[’ = SL2(Z), MaaB]. The function €(2s)M(z;s) extends analyti- 
cally to a meromorphic function of s invariant under s —+ 1—s and whose only 
singularities are simple poles at s = 1 and s =0, where the residues are equal to 5 
and —3. 

2 


Since 1/€(2s) is zero at s = 0, this point is not a pole of M(z;s). All the zeros of 
£(2s) being located in Re(s) < 4, the only pole of M(z;s) in Re(s) > 4 is s = 1. 
As 
(22.41) M(z;8) = y* + €(28 — L)y'* /€(28) + o(% 8) /€(28) 
and as 
(238) = D> an(y; s)e(nx) 
nA0 


is an entire function of s, the residue of M(z;s) at s = 1 can be calculated by 
writing that 


€(2s — 1)/€(2s) = [1/2(s — 1) +at...] / [€(2) + 2€'(2)(s -—1) +... J = 
= 1/2€(2)(s-—1)+ b+... 


where b = a£(2) + €’(2), whence 


lim(s — 1)M(z; s) = 1/2€(2) = 3/7. 


Moreover, 
lim M(z; s) — 3/m(s — 1) = y + y(z;1)/€(2)+ 
+ lim [€(2s — 1)y'~* /(2s) — 1/2€(2)(s — 1)] . 
However, 
&(2s — 1)y'~* /€(2s) = [1/2€(2)(s —1) +64 ...J[1-—(s—l1)logy+..J= 


= 1/2€(2)(s —1) +b— logy/2€(2) +.... 


As a consequence, 


lim | M(z;s) as 1) =ytb * hogy aay nl Lje(nz) , 


s=l1 
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a I’-invariant result. 


Similarly, in the neighbourhood of s = i, 


€(28 — 1)/é(2s) = ae we + Sey Ose eS, 


“2 1 1 
y" = y?[1 6 s)logy+...], 


So 


y? + €(28 — Ly'* /€(2s) = (28 — ly? logy +... 
and 


M(z;s) = (2s —1l)y? logy+...+[(2s—1)+..]] [o(a 5) +--]. 


Hence lim M(z;s) = 0 and 


lim M(z;s)/(2s—1)= y? logy + 3 On (y; sere) 7 
22 nA#~0 


Like the Maa® series, this function is J’-invariant. 
Exercise 4. Set a(s) = Ws(0)¢(2s — 1)/¢(2s). Show that 


M(z;8) — [y*+a(s)y'*] =O ly? exp(—27y)| 


for large y. 
Exercise 5. Let 


f(z) = DJ an(y)e(nz) 


be an invariant function under the modular group and O(y~™) at infinite for all 
N. Calculating the inner product (on ['\G) of f and M(z;s), show that the Mellin 
transform of the function ao(y) can be analytically extended to a functional equa- 
tion ins+>1-—s. 


A far simpler method for obtaining a more general result than the previous 
theorem consists in associating the series 


Or(t;9)= D> F [tg (6) =FO)+ So SOF [nt.g7*y* (e1)] 
€ 


EZ? Uso \F n>0 
to every function F on the Schwartz space S(R”), where e1 is the first basis vector 
of R?. G acts linearly, transforms e1 into primitive vectors of the lattice Z? and 


Us is the stabilizer of e:. Convergence is obvious for t £ 0. Defining the Fourier 
transform by 


(22.42) F(a,y) = [rt vye(yu — xv)dudv 


and using the Poisson summation formula in R?, we get 
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Or (t-';9) =tOp(t;g).- 


Thus the series behaves asymptotically in the neighbourhood of t = 0 since this is 
obvious for large t. Setting 


0% = 0p — F(0), 
Tr(g;s) = [fF (t.g-'e1) t*d*t for Re(s) >0, 


the method used for the Riemann function then leads immediately to the analytic 
extension of the Mellin transform 


(2243) Mr(g:s)= f op(tig)ta"t = C28) > Pe(a9:2s). 
Uso\P 


It has at most simple poles at 1 and 0 and 


(22.44) Ma(g;1— 8) = Mr(g;s). 


Supposing that F[k(a)] = vr(k)F(«) for some character K, I'r(g; 2s) is [n° 15, (i), 
theorem 13] the function of weight r associated to 


f(z) = J(g;1)" Tr (9; 28) . 


ap te 
& y ig | = jr (ty Fer) t?°d*t = Dr(e;2s)y* 
y 2 


implies that 
f(z) =Tr(e28)y°"”, 


Mr(g;s) is then, up to the factor Ir(e; s)¢(2s), the function M,(g; s) of weight r 
associated to the series 


(22.45) M,(z;8) = 5 “(cz +d)~" Im(yz)°""”, 
which is obviously zero for r odd. Choosing 


(u + iv)” exp [—7 (u? + v’)] ifr >0, 


22.46 F(u,v) = 
( ) 2) (u—iv)-"exp[—m(u?+v7)] if r <0, 


we find 
1 
I'r(e;2s) = ea exp (—7t?) ?*d*t = Sm \r|/2) . 


On the other hand, the general relation 


(d/du + id/dv) ‘| F(a, y)e(yu — xv)drdy = 


= an | (w + iy) F(a, y)e(yu — xv)dady 
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shows that Fourier transforms (42) of functions (46) are given by P= (-1)’F. In 
conclusion, for r even, the function 


gn Stl"l/2 P(g — |r|/2) C(28)M,(z; 8) 


extends analytically and is invariant under s+ 1-—s. 

We leave it to the reader to fill in the details and to generalize exercise 5 to this 
case by assuming f to be of weight r. In particular, series (45) are orthogonal to 
parabolic forms of weight r, as well as to their conjugates. This method can also be 
used to compute the Fourier series of M,.(z;s) and more generally of Mr(g; s) for 
all F € S(R*). 

Here too, adelic calculations, inaugurated by John Tate in his thesis on zeta 
functions are far simpler and enable us not only to generalize results to congruence 
groups, but also to replace Q by an arbitrary algebraic number field.** The far 
harder analytic extension of Maaf-Selberg series for an arbitrary Fuchsian group 
was announced by A. Selberg in 1956 in a famous article whose proofs were put off 
till the recent publication of his complete works. But other authors have dealt with 
the subject.®° 


88 R. Godement, Analyse spectrale des fonctions modulaires (Séminaire Bourbaki, 
1964/65), I.M. Gelfand, M. Graev and I. Piateski-Shapiro, Representation Theory 
and Automorphic Functions (trad. Saunders, 1968, or Academic Press, 1990), 
S. Gelbart, Automorphic Forms on Adele Groups (Princeton UP, 1975). 

89 §. Lang, SZ2(R), who presents Fadeev’s the method, Peter Lax & Ralph 
S. Philips, Scattering Theory for Automorphic Functions (Annals of Math. Stud- 
ies, 1976) T. Kubota, Elementary Theory of Eisenstein Series (Tokyo, Kodansha, 
1973), A. Venkov, Spectral Theory of Automorphic Functions (Kluwer, 1990), 
A. Borel, Automorphic Forms on S'L(2,R) (Cambridge UP, 1998). And all this 
was generalized to general semisimple groups by R. P. Langlands at the start of 
the 1960s (Springer Lecture Notes, 544, 1976). 
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23 — Modular Forms and Dirichlet Series 


(i) Hecke series. If 


(23.1) f(z) = ane(nz) 
is an entire modular form of weight r, the function 


9(y) = fliy) — a0 = S> Gn exp(—27ny) 


n>1 
decreases rapidly as y tends to +00. Since f(—1/z) = z" f(z), 
g(y~*) = (iy)"9(y) + aoliy)” — a0, 
and so g(y) = ao(iy)~" — ao + O(y) in the neighbourhood of y = 0 for all N. 


Therefore, the Mellin transform 


+00 
(23.2) Agoy= / ay —aslaeey 


0 


falls within the compass of the general theorems of Chap. VIII, n° 12: integral (2) 
converges for Re(s) > r, extends analytically to a meromorphic function on C and 
its only singularities are simple poles at s = r and s = 0, with 


(23.3) Res, (Ay) = aot”, Reso (Af) = —ao. 


Besides, the arguments used for the zeta function apply here in too obvious a 
manner for there to be any need to give details: introducing the entire function 


+00 
(23.4) At (s) = i alwy’d’y, 


(23.5) Ay(s) = A} (s) i Aj (r 8) — ao [1/s +2" /(r — s)] 


for Re(s) > r, for all s. Hence the functional equation 
Aj(r — 8) = 4" Aj(s) 


and the poles and residues of Af. 
To compute Ay(s) in terms of the coefficients an, we first write formally that 


(23.6) A-(s) = pS an fy" exp(—2mny)d*y = (27) *I'(s) s an/n>, 


where summation is over n > 1. To justify this computation it suffices to check that 


ye / lany® exp(—27ny)| d*y < +00, 


n>1 


which reduces to checking that the series }> an /n° converges absolutely. For this, we 
need information about the order of magnitude of the an. Now, as was seen (n° 17, 
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theorem 16) earlier, an = O(n"/”) if f is parabolic. In this case, the calculation is 
justified for Re(s) > 1+ 7/2, and even for Re(s) > 5 +7/2 using Deligne’s results. 
If f is not parabolic, we subtract f(oo)E,(z) from it to make it parabolic. Since, 
for E,(z), Gn is proportional to o,—1(n), it suffices to analyze the corresponding 
series, namely 


Ds os d’* /n? 


n>1 d\n 


up to a constant factor. Setting n = md, reduces the question to the double series 
Dod / (md)? = ST 1am? = (sr + 1)6(s), 


which converges for Re(s) > r. The associativity theorem for unconditional conver- 
gence (Chap. II, §2, n° 18, theorem 13) then justifies (6). In conclusion: 


Theorem 34. Let f(z) = >> ane(nz) be an entire (resp. parabolic) modular form 
of weight r. Then the Dirichlet series 


(23.7) Ls(s) = — an/n® 


converges for Re(s) > r (resp. Re(s) > (r +1)/2). The function 
(23.8) Ag(s) = (2m)-*T(s)L4(s) 


extends analytically to a meromorphic (resp. entire) function on C whose only sin- 
gularities are simple poles at 0 and r, where its residues equal i’ao and —ao. It 
satisfies the functional equation 


(23.9) Ag(r —s) =i" Af(s). 


For example, if one considers the parabolic form 
A(z) = So r(nje(nz), 


one notices that the series }*7(n)/n* converges for Re(s) > 13/2, extends to an 
entire function and satisfies a functional equation in s + 12 — s. These results 
were conjectured by Ramanujan in 1916 and proved by Mordell in 1917, except 
obviously for the abscissa of convergence which is based on rather difficult Deligne 
theory, and for Ramanujan’s conjecture according to which non-trivial zeros of the 
Dirichlet series are located on the “critical line” Re(s) = 6. It will be seen later 
that, besides, the series }> 7(n)/n° has an infinite product expansion. 


(ii) Weil series. All of the above is in Hecke (Math. Annalen, 114, 1937) ; André 
Weil (Math. Annalen, 168, 1967) discovered that it can be generalized to series 


(23.10) Ly(s:x) = Ss anx(n)/n® 


where x is a character modulo an arbitrary integer m (§3, n° 9). Calculations 
resemble, and for good reason, those developed in n° 10 to obtain the functional 
equation of the Dirichlet series L(s; x). 

In fact, Weil’s calculations are to a large extent valid for all solutions, not just 
holomorphic ones, of the functional equation f(yz) = J(y;z)" f(z). We set 
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(23.11) Flax)= Yo x(afl(@+a)/m] = fe +m:x). 
cases 


Recall that x(a) = 0 if a is not coprime to m. As every so often, it will be useful 
to only use left invariance of the function 


f(g) = J(g,t) "F(2) (2 = gi). 
Using notations (15.16) and (15.18), the translations z +> z+ u are defined 
by matrices z(u) € U and the homothety z +> z/m by the diagonal matrix?’ 
h (m-3), and J[h(t)a(u),2] =t7' for all u, t and z. As a result, 


Relation (11), multiplied by J(g,7)~”, becomes 
1 


(23.12) Igy’) Flix) =m"? S* x(a) fr [hb (m™#) w(a)g] 


amodm 
(a,m)=1 


The factor m~"/? being well-defined only for functions of weight r on G, we are led 
to more generally set 


(23.12) eX = D2 (ay [A (m=?) w(a)g] 


(a,m)=1 


for all left I’-invariant functions y on G, for the moment without any other as- 
sumption. Using the Fourier series expansion 


(23.13) plug) = a an(g)e(u)”, weU, 


whose coefficients 


(23.14) an(g) -/ p(ug)e(u) 
satisfy 

(23.15) an(ug) = e(u)"an(g) , 
we get 


°° Abhorring useless square roots, conservative arithmeticians prefer to consider 
the group GL2(R) of 2 x 2 matrices with determinant > 0, i.e. the direct product 
of G and of R4 ; see the beginning of the next n°. 
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(GX) = D> x(a) Doan [a (m™?) w(a)g] = 


Hence, introducing the Gauss sums 


(23.16) Im(n,x) = > x(a)e(na/m) 


amodm 
(a,m)=1 


of n° 9, (v), 


—1 
9(9;x) = _ Iin(n, x)an ln (m :) al 
nA0 
expression from which ao has disappeared because )> x(a) = 0 if we assume x is 


not the unit character and, a fortiori, as we will do to simplify the presentation, 
that y is a proper character modm. Theorem 5 of n° 9 then shows that 


(23.17) (95x) = Pm(x) Dan [h (m=) g] x(n) 


where Iin(x) = Im(1, x) = 2 x(a)e(a/m) satisfies | Pin (x)| = m2 by (10.6’). 
When y(g) arises from an entire modular form f(z) = }> ane(nz) of weight r, 


an(g) aa JI(g,t) “ane(nz) 2. Z= gi, 


and so 


Since (9; x) = J(g,i)-"m"!? f(z; x), 


(23.17’) F(X) = Tin(x) DS anx(nje(nz/m), 


n>1 


a result that could have been obtained by a direct calculation starting from (11). 
As ao disappears from the result, f(z;x) is an entire series in e(z/m) without a 
constant term, hence decreasing exponentially as Im(z) tends to +00. 

Let us now show that, for any J’-invariant function y(g), 


(23.18) (wg; x) = x(-De (95x) - 


The left hand side of (18) is obtained by replacing g with wg in (12). Denoting by 
a’ a general solution of aa’ = 1modm, the right hand side of (18) is obtained by 
replacing x(a) with x(—a’) in definition (12’) of y(g; x) or, what is equivalent since 
a+—+ —a’ permutes the summation, by replacing a with —a’ in the term x(a/m). 
Hence to prove (18), it suffices to show that the y takes the same value at points 
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h m-4) x(a)wg and h (m-3) x(—a’)g of G. The only way to do this is to check 
that, for all a and g, 
h (m-?) x(ajwg EIA (m-?) z(—a’) g. 


However, this means that the entries of the matrix 


(m4) atau (a)n(m-l) = (fe dim) 


are integers, which is clear. 
When ¢y = f, is associated to a generalized modular form f(z) of weight r, 
going from g to wg transforms z into —1/z. As 


J(wg,t) = I(w, z)J(g,t) = J(g, tz, 
(18) becomes 
(23:18) F(-V/ax) = x(-V)2"F (8X) - 


If the right hand side decreases exponentially to 0 as y tends to +00, this relation 

shows that so does the left hand side and hence the right hand side as y tends to 0. 

This is the case of entire modular forms and of Maaf8 series since going from f(z) to 

f(z x) removes the constant term of polynomial growth from their Fourier series. 
This being done, we associate the Mellin transform 


(23.19) Meo / o(hg;x)a(h)*dh = Ag(—g; 8,%) 


to y, where a(h) = Im(hi) = t? for h = h(t). Convergence for all s is obvious if y = 
fr for some entire modular form of weight r or a Maaf series (21.17) or if, trivially, 
it is continuous and with compact support mod I’ for then a continuous function 
of h with compact support is being integrated. In fact, the correct assumption to 
use is y(g) = Ola(g)*]. So using (18), we find 


Agteeee = / y (hg;X) a(h)*dh = x(-1) i v(whg; x)a(h)*dh = 
= x(-1) f (hg: xda(h) “ah 


(23.20) Ag (93 8,X) = x(-1) Ay(wg; —s, x) - 


To obtain Weil’s result, we apply (20) for g = e to a function 
9(9) = JG, 4)" F(2) = Jg, i)” SS ane(nz) 
of weight r. For h = h(t), 
yh; x) = J(h, i) "m"” fliysx) = mt f (t7i;X) 


by (12) and (12’), and setting by definition, 


+oo 
As(83x) = i fliy xy dy 
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for every generalized modular form f(z), for example entire or Maa, for which the 
integral converges, we get 


Ay (e.8,x) =m"? [af (Bix) Pat = 
1 r/2 - a) str/2 7% 1 r/2 aaa 
= ait Fliysxyy dy = 5m" As (8 + 17/2;X) « 
On the right hand side of (20), we need to compute 
Ay(wi—s,x) = f(b; xJa(h)*dh 


where 


mm"? o(hw;x)=m"/? I(hw, i)" f(hwisx) =i "t f(hisx)=i"y"” figs x) - 


Hence 
m~"!? As(w;—8, x) = mfr vy od*y = 
=i" / Ff (iy; yl? sd*y =i "Af(r/2—5,x). 


Relation (20) then becomes 
Ay (8-+1/2,) = x(-1)Ap(r/2— 8,x) 


or, in Weil’s form, 


(23.20’) Ay (r — 8,X) = x(-1Ay(s, x) 
Here too, (20’) can be easily deduced from (18’). 
As 
| flivoou'd'y = En = () Dan [xe (—2nny/m) y°d*y = 


= Pin (x) 2m/m)~* P(8) So anxtnyn™ 


where summation is over n > 0, the usual arguments together with (14) lead to the 
final result: 


Theorem 35 (A. Weil). Let f(z) = 3°.) ane(nz) be an entire modular form of 
weight r and y a proper character modulo m > 1. The function 


flaxy= SO x(a)f le +a)/m| 
amodm 
(a,m)=1 
satisfies functional equations 


fle+mx) = flax), f(-1/25x) = x(-1e2" f(z x)- 
The integral 


Az(8;x) = [ fou = 


(23.21) = Im (X) (2n/m)~*I(s) S> anx(n) 


n>1 
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converges for all s € C, is an entire function and 
(23.22) Ay(r — 85x) = #"x(—1) A; (8; X) - 


Exercise. Write down (21) explicitly when f is an Eisenstein series. 

Weil’s far more complete results are mostly about congruence groups and admit 
a converse which says that if the series }>anx(n)/n*, with given an = O(n”), 
satisfy (22) for “sufficiently many” characters x, then f(z) = )>ane(nz) is a 
modular form?! for a congruence group. Weil’s aim was to show that zeta functions 
of elliptic curves over Q are obtained in this manner using modular forms with 
respect to congruence subgroups. Once you get on this slippery slope, you run the 
risk of going, like Deligne, Serre and others, as far as the now proved Taniyama, 
Shimura and Weil conjectures, Fermat’s theorem, etc. Mercy! 


(iii) Generalization to non-holomorphic forms. Apart from functional equa- 
tion (22), we would like to put A,(g;s;x) in the form of a Dirichlet series for 
an “arbitrary” function y on I’\G as we did when ¢ arises from an entire modular 
form. This is possible using an assumption known to hold in the case of Maaf- 
Selberg series. 


Writing the Fourier series expansion 


(23.23) (ug) = 5 an(g)e(nu) 


of ~ gives 
(ug x) = Fin(x) D> an [bh (m=?) e(u)g] x) = 
= Fin(x) Yan [2(u/m)h (m-*) g] x(H) = 
= Fin(x) San [h (m=?) g] x(r)e(nu/m) 
Thus 


= m'Fon(x) SOX) fan (hthg) ae. 


When y(g) = J(g,i)-"f(z) = J(g,t)~" Xo, e(nz) arises from an entire modular 
form f of weight r, 


(23.24) an(g) = J(g,t) “Gn exp(—27ny)e(na) . 
So 


°l A. Weil, Uber die Bestimmung Dirichletscher Reihen durch Funktionalgleichun- 
gen (Math. Annalen, 168, 1967, pp. 255-261). See also Andrew Ogg, Modular 
Forms and Dirichlet Series (Benjamin, 1969) and Serge Lang, Modular Forms 
(Springer, 1976). All this has been fully revolutionized in an adelic framework by 
Hervé Jacquet and R. P. Langlands, Automorphic Forms on GL(2) (Springer, 
1970, Lecture Notes n° 114, 548pp.) and Hervé Jacquet, Automorphic Forms 
on GL(2), Part II (Lecture Notes n° 278, 1972). 
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an [h(t)g] = J(g,t) “ante (nt? z) 


and 


fo [A(t)g] t°d*t = J(g,i)” if ane (nt?z) Pt d*t. 


i exp(—wt)t°d*t =I'(s)w ° for Re(w) > 0, 
0 
and equality showed in Chap. VIII, n° 10, it follows that 
fo [A(t)g] #25a"t = 539.8) "onl (8 +1/2) (—2minz)77/2-8 
In particular, for g = e and z =i, 
fo [a(t)] 2d"t = sian (8 +r/2) (Qnn)77/2-5 = 
= si On) or (s+1r/2) Gagne : 


whence a Dirichlet series in accordance with the previous theorem. 

In the case of a function y(g) = J(g,i) "f(z) = >> an(g) associated to a not 
necessarily holomorphic function f, the previous calculation again leads to a Dirich- 
let series if, instead of (24), we have a relation of the form 


an(g) = J(g,4)~" anW (2n|nly) exp(2rina) 
i.e. 


(23.25) f(z)= ‘2 anW (27|nl|y) e(na) , 


with constants an and a function W(2z|n|y)e(nz) replacing the function e(nz) = 
exp(—27ny)e(nz) of (24) for n > 0, as in the case of the Maa series. Indeed 
relation (25) shows that 


ic [A(t)g] t°d*t = [4 i) "t"anW (27|n|t?y) e (nt?) e*d*t = 


= 55(9,8)-"an|n|°"/2 (an) 20? 
x / W (ty)e(ta)t°t"/2a"t 
and in particular that for g = e, 
(23.26) / an (R(t)] P°d"t = 5 (2m) Fw (9+ 7/2) an|n| 2"? 


where 


(23.27) Pits / W(tyed't 
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is the Mellin transform of W. 
In the case of the Maa series Moo(z;u), for u > $5 the function 


W(t) = W.(t) =fo a?) ~" e(—te)de 


is the Fourier transform of (1 + 2”)~“. By the change of variable 1+ 2 = 1/y, its 
Mellin transform is 


+00 a 1 +00 
/ (1+2°) “a°d*s = = / (lta) "a8? d*a = 
0 2 Jo 


1 
as =f  elicame G — y)/ dy = 
0 


= ar (u— 8/2) P'(s/2) /T'(u). 
The general relation 
n°? P(s/2)Ps(1—s) = 2-9-9? 7 [1 — 8)/2] P5(s) 
of n° 1 of this chapter, applied to f(x) = (1+ 27)~“, then shows that? 
n~-8)/271(1 — s)/2] Pw(s) = 
= 4479/2 7(s/2)T [u— (1— 8)/2] I [(1 — s)/2] /F'(u). 
Thus 


Tw(s)=72—°P (u- +=) r(5)/rw) if W = Wy. 


This relation makes it possible to compute the series L+(s; x) corresponding to 
MaaB8 functions f(z) = M(z;u). The result is not of much interest since, as was 
seen in n° 22, eq. (35’), in this case 


E(2u)an(y;u) = 1-2u(n)m“P(u)y’ “Wa (|nly) - 
From what precedes, this is a function whose Mellin transform at y is 
O1—2u(n)a~“P(u)|n|“*'Dw(s +1—u). 


We, therefore, obtain, up to factors I’, the series > o1~2u(n)/|n|*t'~", whose func- 
tional equation is obvious. But as an exercise, the reader can give the details of all 
the computations in this case. 

The existence of a Fourier series like (25) means that the integral 


iz f(x + iy)e(nx)dx 
«amodZ 


only depends on the product |n|y. If W¢(27y) denotes its value for n = 1, then 


1 
| f(a + ty)e(—nx)dx = Wy (27|nly) . 
) 
This property is related to Jacquet-Langlands “ Whittaker models”. 


°? Restrictions obviously need to be imposed on s and u to ensure that the integrals 
converge. 
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24 — Hecke Operators 


(i) Operators T(x) for an abstract group. If a function y(g) on a group G is invariant 
under a subgroup I of G, it is no longer in general the case for its left translations 
under L(x)y : g > v(a~'g). Indeed 


L(y) L(a)p = L(a)L (a "y2) 9, 
so that invariance of L(x)y under y presupposes «ya € I’ and so 
y¥€aPe 'nr=T(e). 
Hence only 
(24.1) ye T(x) => Lys)p = L(x)p 


can be guaranteed. Article 1 of group theory rules having been recalled, let us recall 
article 2 (Poisson-Eisenstein law): to make L(x)y I’-invariant, replacing it by the 
sum 


(24.2) T(2)p => Lyya)e 


extended to y € I’. But nothing ensures the convergence of the series, everything 
even shows that in general it diverges. Formula (2) is nonetheless well-defined if we 
only consider matrices x for which series (2) can be replaced by a finite sum. Since, 
by (1), L(yx)y only depends on the coset yI(x), it therefore suffices to assume 
that there finitely many such cosets in I’. In fact, as will be seen, it is better to 
assume that 


(24.3) I'(a) and (a ') have finite index in I 

if we want results. We then also have 

(24.3’) [2Pa :I'(x)] < +00, [x Pz ae (x~*)] < +00 

because the automorphism g +> zgx~' maps I onto 4I'x~' and I'(a~') onto 
aD (2-*)a7 =a(2"henr)e * =Inalx' =T(2). 

From now on 

(24.4) C(a)=La "Pr. 


The group I acts on the right and on the left on C(x), so there is a quotient set 
I’\C(a), which is also the quotient of 2~'I by the equivalence relation P'a~!y = 
Ix ~+y'. However this is equivalent to 2~!y'y~'a € T, ie. toyyt € alae Ar = 


I(x). Hence a bijection 
(24.5) I(2)\\l — I\C(2) 


transforming the coset I’(x)y into the coset [a7 y. 
We will denote by I the set of x satisfying (3). These are the elements of G 
for which operators 


(24.6) T(x)p= S> L(ya)y 


yD (2) 
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can be defined or, more explicitly, using map (5), 


(24.6) — T(a)o(g)= D> v(e@ ty") = SS ele 9) = D2 vlug). 


yT (x) D(x)y I\C() 


A first trivial property of operators T(x) is that they commute with right trans- 
lations, which act on left T-invariant functions. Another is that T(x) only depends 
on the double coset C(x) = 'x~'T as shown by the latter sum (6’). Others will be 
useful for the modular group, but can be proved in all generality, which avoids ques- 
tions unrelated to the problem or restrictions to operators associated to elements 
x € I’ that are too specific to make calculations comprehensible.°* 


Lemma 1. I” is a subgroup of G. 
Clearly, x € I’ implies x~' € I’. On the other hand, 


P(ey) = ayy a NP =a (yy nes) 2. 


But if (x) has finite index in I’, y'(x)y~* has finite index in yl'y~*, so PNyI'(a)y 
is of finite index in PN yIy~' = I'(y). However, 


Ponyl(«)y' = Pny(£nalz*) y= 


1 1 


=Pnyly' nytla ty =T(y) nyse ty’. 


As this group is contained in I’, it is equal to 
PY) Nyala ty =Py) NT (y2), 


which is of finite index in I'(y), hence in I’. As it contains I'(yx), it follows that 
I'(yx) is nenecessarily of finite index in I’. So yz satisfies the first condition (3), 
and so does xy, thus also xy—!, ged. 


Lemma 2. For all x,y € I", the operator T(x)T(y) is a linear combination with 
integer coefficients > 0 of T(z),z€ I". 


By (5), 
T(2)Ty)e(g)= S> Tew)elug)= S> dS) ylurg). 


I\C(a) I'\C(«) P\C(2) 


The result is of the form >> c(x, y; w)y(wg) with integers c(x, y; w) not depending on 
y for any w € I’. To calculate them, we count the ordered pairs (u,v) € C(x) x C(y) 
such that uv = w taking into account the fact that only the cosets Tu and Iv 
matter. Since replacing u by yu replaces w by yw, the product wv = w only occurs 
via the coset [’w, which means that 


(24.7) c(x, y; yw) = C(x, 5 w) - 
The substance of the proof consists in showing that we also have 
(24.7’) C(x, y; wy) = C(x, Y; w) - 


°3 In 2000 or thereabout, some authors still for example, replace summation over 
I'(x)\I° by summation over a “system of representatives” of cosets '(x)y in I, 
i.e. over y; such that each coset I'(x)y contains a unique 7;. Quotient sets were 
invented more than sixty years ago to get rid of this type of calculations. 
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As C(x)~1 = C(a7"), uv = w implies that 
v=u wed (x ")wnCly), 


where this is a left [’-invariant set dependent only on I°w. Conversely, every element 
v of this set is in C(y) and satisfies uv = w for some u € C(x). However, knowing 


v determines Pu = wv '. Replacing v by yu replaces u = wv! by uy! = 
wo 'y7! € wu! TL, and conversely. So, for given I'v, the cosets Iu involved in the 


calculation of c(x,y;w) are those having non-trivial intersection with wv !T, ice. 
those corresponding to elements of Pwv !I’. There are 


Card (\Pwv'P) = Card [[\Pw(Pv)7*] 
of them. Hence 
(24.8) C(x, y;w) = ye Card [P\Pw(Pv)~*] ; 
DvCC(x2)wnC(y) 


where summation is over cosets I"v. 
To deduce (7’), we write 


c(xz,y; wy) = ‘Dy Card [P\Pwy(Pv)~*] = 
DTvCC(x2)wyNnC(y) 


= by Card [r\rw (Fey) ] = 


TvcC(C(x)wnC(y)}y 
= se Card [P\rw (Poy) *] . 
Tvy7!CC(a«)wnd(y) 


The sum obtained can be deduced from (8) by the permutation Pu > Tvy~', 


whence (7’). 
This being settled, let us calculate the contribution of a double coset 


lull = U I'wy 


to the sum 
T(x)T(y)e(g) = >> ela, y; w) (wg) 


extended to all cosets "'w € I’, but with finitely many non-trivial coefficients. It is 
clearly 


e(z,y;w) SY) v(wyg) =c(2,y;w)T (w*) oly). 
rw=t)\r 


Summing over all [wI” now gives the final formula 


(24.9) T(x)T(y) = ys c(x, y; w)T (w*) : 
r\r'/r 


This finishes an “elementary” proof in which, believe the author, it is easy to get 
lost in dead-ends. 
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(ii) Operators T(x) for a locally compact group. We now assume G to be locally 
compact unimodular and I to be discrete. 


Lemma 3. For all xz € I’, the operator T(x) maps L(I'\G) to L(I\G). 


For y € L(I'\G), let M be a compact set such that y is zero outside 1M. 
T(x)o(g) £0 only if «~'I'g#I'M, hence only if g € PaPM = Pala '.xM. By 
(3’), ePx~+ is a finite union of cosets '(x)hi. Thus Pala 7t.aM = UThieM = 
I'M’, where M’ = (JhixM is compact, qed. 


Theorem 36. For all x € I” and all p > 1, the operator T(x) : L(I\G) — 
L(I'\G) has a continuous extension to L?(I\G) and 


(24.10) (T(x) lh) = (g|T (x~") ¥) 
for all yp € L?([\G) and p € L4(I\G). In particular, 
(24.10) ay afar") 

in L?(I\G). 


Let us start by remarking that the automorphism g > «2 ‘gx preserves the 
measure dg and that its inverse transforms I into «fa *. Hence for all p > 1 we get 
an isomorphism from L?(I’\G) onto L?(aP'x~'\G) by transforming every function 
v(g) of the former space to a function y(x~'gz) of the latter one.** It follows that 


[leo a= [ |p (x gx) |" dg = 


2DP2-1\G 
(24.11) =[ |p (x *g)|” dg 
«0a-1\G 


since quotient measures are right invariant. The same argument implies that 


(24.11’) ie p(g)dg = = ef p(xg)dg 


for all y € L'(I'\G). 


°4 The attractive formula 


| y (2729) ¥(dg) = i ola)o(ea)dg 
I'\G 


r\G 


where y and wy are left [’-invariant and where x ¢ I’, is not well-defined since the 
function g ++ y(a~'g) is not I-invariant. But formula (11’) is correct. More 
generally: let G be a lcg, H a closed subgroup of G and s an automorphism of 
G preserving the measure dg. Then, for any left H-invariant function y, 


_ atgag = | (sg)dg . 
H\G s-1(H)\G 


This relation is connected to integration formulas for images of measures 
(Chap. XI, n° 13, theorem 23) and, if H is discrete, reduces to them by re- 
placing integration over H\G by integration over a “fundamental domain” of H 
in G. See the change of variable formula for the simplest Riemann integrals over 
R. 
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Then let y and w be two continuous functions with compact support mod I’. 
(11’) applied to T(x) y(g).w(g) gives 


/ T(x)o(9).6(g)dg = / T(2)o(e9).o(eg)dg = 
I\G 


x—!Pa\G 
= dg y («~*yag) (29) 
—_— oo: 
car o(yg) = ¥(9) 
= i dg SY. plyg)v(x79), 
x-!D2\G 


D(a-1)\a-1Pa 


which is seen by transforming the sum by y +> 2~'ya. Thus 


Ts _gPe@eto)-w(a)dg = olg)b(eg)dg = 


(o-4)\a 

g(g)dg SY) v(xryg) = 
P(«-1)\r 

=| oa) (e) vag. 
r\G 


Hence, replacing w~ with its conjugate, 
(24.12) (T(x)elv) = (yIT (e™*) 4) , 


where this is the inner product on L?(I"\G) or, more generally, the duality between 
L? and L‘. Therefore, to finish the proof, the continuity of T(a) on L?(I’\G) 
remains to be shown. 

To do this, we start from the relation (same proof) 


i L(e)o(9)-0(o)dg = | T(#)9(o)-0(a)do, 
I'(x)\G I'\G 


where y and w are continuous with compact support modI’. It shows that 
|(T(x2)p|)| is bounded above by the product of the norms of L(x)y on L? (I’(x)\G) 
and of w on L7(I'(#)\G). But by (11), 


/ IW)? dg = (P= F(a) lle, 
I'(x)\G 


where this is the norm on L‘(I‘\G). Hence (Hélder) 


1/p 
\(T(a)elb)| < Ws P(w)]"/4 la (/ joe] , 


I'(x)\G 


where the norm of yw is calculated on L4(£’\G). Applying g > xgx~' to the latter 
integral transforms it to 


§ 8. Hecke Theory 455 


So finally, 


Syya ay 
(T(z)elv)| <P P@)"* (Pe P(e)? [elle la 
As this inequality holds for all y, w € L(I’\G), it follows that the operator T(x) 


extends to L?(I’\G) and satisfies 


(24.13) IT(2)|| < (0: P@))!4 [PP (@)]”” 


on L?(I'\G), which gives the theorem proved for holomorphic modular forms in 
1939 by its inventor, Hans Petersson by repeatedly using fundamental domains.?° 


(iii) Operators T(x) for the modular group. If G = SL2(R) and if I is the 
modular group, as will be seen, x can be chosen to be any matrix proportional to 
a rational matrix or equivalently, an integer matrix with determinant > 0. Such a 
matrix can be written as 


(24.14) z= ni ( ys a,b,c,d€Z, ad—be=n>0 


and only these matrices will be used in what follows. Replacing G = SL2(R) with 
GL2(R) would avoid square roots, disliked by arithmeticians; it is the point of view 
taken by Jacquet and Langlands, but here I will keep to SL2(R). 

As the operators T(x) only depend on cosets aI’, we start by determining 
them. The answer is provided by the elementary divisor theorem:*° if L is a lattice 
in Z”, there is a basis (a;) of Z” and integers d; > 0 satisfying 


(24.15) dy|d2|...|dn , 


fully determined by LZ and such that the vectors b; = dia; form a basis for the lattice 
L. This being so, if g € M,(Z) is a matrix with determinant > 0 and if this result 
is applied to L = g(Z"), one can deduce that there are matrices y,y' € SL,(Z) 
such that 


ygy' = diag (di,...,dn) . 


Let us come back to SL2. Matrix (14) can, therefore, be written as 
iy & 2) ~' with 0 < dildo, 


up to the factor n-2, where y,y’ € [= SL2(Z). So it is possible to only consider 
diagonal «. The matrix «1 is then proportional to wzw +. Thus 


(24.16) Forata Lf Te Nate. Ea yale) 


for all x € I’. (13) then shows that |/T'(x)|| < [0 : '(2)]. 
This can also be proved without knowing anything. To this end, let us consider 
the involution o of G given by 


°5 The same could be done here provided G is known to contain a measurable set 
F meeting each coset I'g at precisely one point. True, but you will not be able 
to prove this in the general case without using the methods of Chap. XI, n° 12. 

°6 See for example exercises in § 31 of my Cours d’algébre, or, of course, Serge Lang, 
Algebra. 
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ab\ fd b\ 1-4 _f1 0 
(24.17) aG ‘) = (: 3) =wg w where w = ( ae 


We have o(I’) = I’, whence [aI = o(Io(x)L), ie. 


Tel =w Pwr bwolwaw ww wow =Te lr. 


Theorem 37 (Hecke-Petersson). The operators T(x) commute and are Hermi- 
tian on L?(I'\G). 

The latter point is obvious: T(x)* = T(x~') (theorem 36) for all 2 € I’, and 
T(x!) =T(a). 

To prove the former, we start from (9) 


T(a)T(y) = >) ela, y32)T (27) = ez, 3 2)T (2). 


The coefficients being real, it follows that 
on L?(I’\G), and so 


qed.9” 


(iv) Operators T(p): the case of functions on I'\G. As the substance of the the- 
ory is obtained by supposing n to be prime in (14), for the moment this restriction 
will apply. Still without knowing anything, let us first show that [al = IhpI, 
where generally speaking, we set 


“is 
(24.18) ee LL cea ea 
0 |m|~? 


If x is as in (14) with n = p, then ad — bc = p and the gcd of c and d is either 1 
or p. If (c,d) = 1, a’d — b'c = 1 can be solved; so there is some u € Z such that 
a= pa’ +cu and b= pb’ + du; then 


_1 fab -ifluw 0\ fa’ v' 
ie oa & i) =P : (( 7 G ; € 4) = 2whor! 


where 7’ € I’, and so aI’ = T'h,I since x(u) € I. If (c,d) = p, then (a,b) = 1, 
the matrix wz falls in the previous case and as w € I’, once again al = [wal = 
Th,I, qed. 

We will set T(p) = T(hp) = T(h>'), whence 
(24.19) T(p)e(9)= S_> ol(hp 79) 


D(hp)y 


°? Some readers will probably regret that this proof of a purely algebraic result 
uses calculus from theorem 36. But a substitute for this theorem could be use 
by considering G as an “abstract” or discrete group and by replacing L(I’\G) 
with the space of [’-invariant functions having finite support mod I’. Integrals 
are then replaced by finite sums. 
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where summation is over ['(hp)\I° and where I'(h,) is the set of y € I such that 


hp yhp ET. 
al 1 
p2z 0 a b\ [ p2 0 \_ fa b/p 
0 p2 cd 0 pz} \wc dad)’ 


shows that I'(h,) is the congruence subgroup b = 0 mod p of I’, which corresponds 
to a Borel subgroup in SL2(Z/pZ). The Bruhat decomposition being valid over 
any field, cosets I'(h,) are represented by matrices 1 and wy(m) = «(m)w where 
m varies mod p, or by matrices w and x(m) since y > yw permutes these cosets. 
There are p+ 1 of them and 


T(p)e(9) = ¢ (hp wg) + 9_y [hp x(m)g] . 
As 9 is w-invariant, we finally find 


(24.20) T(p) (9) =e (hog) + D> [hp a(m)g] . 


m mod p 


NIR 


Let us now analyze the effect of the operators T(p) on the Fourier series 


(24.21) 9 [2(u)g] = }0 an(g)e(nu) 
of y~, whence 

an [x(u)g] = an(g)e(nu) . 
By (20), 


T(p)¢ [x(u)g] = ¢ [hpa(u)g] + Soy [hp *a(ut+ m)g] = 
= ¢[x(pu)hpg] + D> 9 E (=) hy*4] = 
= ge su (hpg) e(pnu) ye Doan pg) e(nu/p) > e(m/p)” 


m mod p 


On the right hand side, the sum over m equals 0 if p does not divide n and p if it 
does. Hence 


T(p) =. ae hpg) e(pnu )+ 55 papn (A Pp > 9) e(nu). 


The coefficients of the expected expansion 


(24.22) T(p)¢ [x(u)g] = 5 > ap:n(g)e(nu) 
are therefore functions 
Jey, ‘ 
papn (Rpg) + 4n/p(hpg) — if pin 
(24.23) Apin(g) = - 
papn (hp 9g) if (p,n) =1. 


In particular, 


ap,0(g) = pao (h;‘g) + ao (hpg) - 
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So equality ao(g) = 0 is preserved. 
Since hmhn = hmn, relations (23) can also be written 


(lige i (nz) ) if pin, 
(24.23’) ea. (hon 9) + On/p (hajpg) — if pln 
Papn (Rpn 9) if (p,n) =1. 


(v) Eigenfunctions of Hecke operators. Eigenfunctions of operators Tp) have 
curious properties. Let us suppose that a function 


(9) = a0(g) + > an(g) 


satisfies 


for all p and set 
(24.24) Wnrl(g) = nan (hg) for n 40, 
whence W,,(gu) = Wn(g)e(nu) and 
(9) = a0(9) + 5 n7* Wr (Ang) - 
nA~0 


Relations (23’) multiplied by n become 


Won(g) + PWr/p(9) 
(24.25) Xp) Wag) = 

Won(g) 
according to whether p divides n or not. In particular, 


(24.25’) Wep(9) = A(p)W21(9) 


for p prime, as well as 


(24.25”) (p)ao(g) = pao (hp 'g) +40 (hpg) - 


Lemma 4. Every function A(p) defined on the set of prime numbers extends to a 
unique function X(n) defined for n > 1 and such that (1) = 1, 


(24.26) A(p)A(n) = A(pn) if (p,m) =1, 
(24.26’) A(p)A(n) = A(pn) + prA(n/p) if pin. 
Moreover, (mn) = A(m)X(n) if (m,n) = 1. 
The proof is obvious. Since A(1) = 1, (26) holds for n = 1 for all p. Because of 


r (p**") = X(p)A (v*) — pr (p**) 


it is possible to calculate the numbers \(p*), hence the numbers \(n) = T] \(p") 
starting from the prime factor decomposition of n. The function obtained obviously 
satisfies the conditions of the lemma. 
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Similar arguments and relations (25) and (25’) show that, for all g, Wi(g) = 0 
implies W,,(g) = 0 for all n > 0. If, on the other hand, Wi(g) 4 0, the lemma 
applies to expressions W;,(g)/W1(g). Hence 


(24.27) Wen(g) = A(\n|)Wai(g) for all n > 0 
or, coming back to the an(g) and to (24), 

A(n)ai (hn for n>0, 
a ee ites im for n<0, 
as well as 
(24.28") X(p)a0(g) = ao (pg) + pao (519) - 


The Fourier series of vy is, therefore, of the form 


(24.29) g(g) = D> A(In|n“* Wea (ng) + a0(g) + D> A(n)n- Wa (ng) - 


n<O n>0 


Lemma 5. Relations (26) and (26’) are equivalent to the identity 


(24.30) S~A(n)X” =] [1- A@)X + pX?]* 


n>0 


between formal series. 


Setting 
k\ yk 
F(X) = > (p*) x*, 
it is clear that the series F(X) = > \(n)X” satisfies 
F(X) =|] F(X). 


But (26’) for n = p*** implies that 
F,(X) =1+A(p)X + JA (v‘*?) Xht? 


=14+X(p)X > [Aw)a Go) Sa (v')] ete 


= 1+X(p)X + A(p)X [Fp(X) — 1] — pX?Fp(X) = 


= 1+A(p)X Fp(X) — pX? F(X) 
(24.31) [1 — A(p)X + pX?] F(X) = 1, 


whence (30). The converse is obvious. 

Identity (30) between formal series can be transformed into an identity between 
convergent series if (A(n)) does not increase too rapidly. It suffices to replace X” 
with any other function a(n) satisfying a(1) = 1, a(mn) = a(m)a(n) if (m,n) = 1 
and a(p*) = a(p)*. Fy(X) is then replaced with 
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-1 


SoA (v*) a(p)" = [1 — A(p)a(p) + pa(p)”] 


Computing the product gives 


(24.32) S> A(n)a(n) =] [1 - A@)a(p) + pa(p)”] 


n>0 


-1 


For example, we may choose a(n) = x(n)n-*, where y is a character modulo an 
arbitrary integer m > 0. Then 


(24.38) Lnso Mn)x(n)/n* =T] [1 — A(P)x(p)p~* + x (p”) pt?) 


This supposes A(n) = O(n”) for sufficiently large co. 
Multiplying (33) by W+1(g), we get two relations 


Yo Wan(g)x(n)n~* = 


n>0 


21(9) T] [1 —A@)x(p)p? + x (p?) p] 


(24.33’) 


] 
= 


(vi) Applications to modular forms. All of the above only assumes that y(g) is 
an eigenfunction of the operators T'(p). Calculations now need to be made more 
specific in order to get results about modular forms. 

First of all, the operators T(2) commute with right translations and preserve 
the subspace ¥,(I"\G) of functions of weight r. Hence, if 


(9) = J(9;t)"f(9t) = fr(g) 
is a solution of f(yz) = (cz +d)" f(z), 


(24.34) T(p)fr(g) = F(g3 4) "Tr(p) f (2) 


where the function T;(p)f has the same property. It is given by 


T,(p) f(z) = J(95 4)" fr (tpg) + S— J(g30)" fr [hp '@(m)g] = 
= J(g; pees a)" f (hpgt) + 
+J(9;4)" $> J [hp *a(m)g; i] f [hp *a(m)gi] = 
= J (hips z)” f(z) ieee (hp';2+m) " f((z+m)/pl, 
whence 


(24.35) Tr(p) f(z) =p"? f(pz) +p? So f(z +m)/p] . 


m mod p 


The standard Hecke operator 


(24.35’) Tpf(z)=v" f(z) +p > fl(e+m)/r], 


r/2-1 


which it would be more appropriate to denote by T;.,, is the product of p and 


the previous operator. It applies to all solutions of the functional equation 
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(*) f(q2z) = (cz +d)" fz), 


whether holomorphic or not. To avoid confusion, for functions on G, I will keep to 
definition (20), which does not assume the weight of y to be r, and for functions on 
P to (35), even if this means reformulating results in Hecke’s notation to obtained 
somewhat simpler formulas, for example by using the fact that 


(24.36) -T,(p) f = A(p) f —+ Tof =Apf where Ap = p"’?"X(p). 


Because of (35) the operators T;.(p) clearly preserve the set of entire (resp. 
parabolic) modular forms of weight r. So they act on the space H?(I'\G) Cc 
L?(I\G) of parabolic forms of weight r. The operators T(p) are Hermitian and 
commute pairwise (theorem 37) on this finite dimensional space. As a result, the 
eigenfunctions of the operators T(p) generate the vector space H?(I°\G), a result 
due to Petersson. 

If f(z) = SSan(y)e(nz) is the Fourier series of a solution of (*), comparing 
with (21) shows that, for z = gi, 


(24.37) an(y)e(ne) = J(g;4)" an(g) = J(g38)"m Wa (hing) - 


Replacing g with hz‘g replaces J(g;i) with J(hn‘g;i) = In|2 J(g; 2) and z with 
h,*z = z/|n|. Hence 


(24.38) Wr(g) = n|n|-"/?I(g54) "an (y/|n|) e(2) , 
(24.39) Wr(ug) = Wn(g)e(£u) 


for all u € U, the sign that should be chosen being that of n. 
In the case of an entire modular form f(z) = ane(nz), 


An(y) = an exp (—20ny) 
with non-zero a, for n < 0. Thus 
(24.40) Wrlg) =n)" an J(g; i)” exp (—2ry) e(x) 


1-r/2 


for n > 0, a result proportional to n an = al,. Setting 


(24.41) Tr(p) f(z) = d0 apine(nz) , 

according to the case, relations (25) transform into 

(24.42) p)a,, = Onis + Pan /p or Ayn 

if T(p)f = A(p)f for all p. Lemma 4 then shows that 

(24.43) a, =X(n)a,, ie. Gn =Anai for n>0 

by (36). Incidentally, observe that if the form f(z) is parabolic, it is determined 


by Ap, up to a constant factor. 
For n = 0, (42) shows that 


(24.44) ap FO => Ap =14+p"” * =or-1(p). 


This expression is the Fourier coefficient a, of the Eisenstein series E,,(z) of (17.25), 
up to the factor —2r/b”. To deduce 
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(24.45) Tp Er(z) = ApE,(z), 


it suffices to check (42) for all a}, corresponding to an = 0r—1(n), which is easy. As 
Maaf series also satisfy (46), with other eigenvalues, this proof of (46) is obviously 
not correct... 

To conclude this n°, let us summarize the main results obtained: 


Theorem 38. For every even integer r > 2, the space of parabolic forms of weight 
r for SL2(Z) has a basis whose elements satisfy the following properties: 


(i) Tp f =Apf for all prime p; 
(it) f(z) = DU Ane(nz) ; 


(wt) for allm > 1 and all characters x modm, 


(24.46) S>Anx(n)n~* =] [1 Avx(p)p* + x(p)?p 9] 5 


(iv) the function f is fully determined by the eigenvalues Ap. 


These results complete Weil’s theorem [n° 23, (ii)]. As ||Z'(p)|| = ||T'(hp)|| < [LF : 
I'(hp)] = p+1, |Ap| < p?~1(p + 1), a “trivial” result proving the convergence of 
both sides of (46) for Re(s) > r/2+1. 

For r = 12, the only parabolic form is the function 


A(z) = S > r(n)e(nz) : 


This is obviously an eigenfunction of the operators Ti2(p), whence the relation 


S>r(n)x(n)n-* = J] [1—r(p)x()p* + x(p)?p 8] = 


conjectured by Ramanujan in 1916 and proved by Mordell in 1917, both of whom 
only considered the trivial character. 

There are also Hecke operators associated to congruence groups. The Jacquet- 
Langlands method reduces them to convolution products on the adelic group SL2 
or GL2, but it does not avoid explicit computations to get precise results about 
the associated L-series. See Koblitz’s book, already referred to, which keeps to the 
most classical methods. 
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25 — Lie Groups 


(i) Definition and examples. A Lie group G is both a C® manifold and a group, the 
map (2, y) > ry! from G x G to G having to be C®. Then so are (x,y) +> xy, 
xr >a', etc. Maps «+> az or xa or x are diffeomorphisms from G onto itself. 
If G and H are to Lie groups, a homomorphism from G to H is a C® map f such 
that f(xy) = f(x) fly). 

Additive groups R”, multiplicative groups R* and C*, n x n triangular matrix 
groups with non-trivial diagonal entries, or else > 0, or else equal to 1, groups 
GL, (R), open in M,(R), groups GLn(C), open in M;,(C), are Lie groups. 

A subgroup H of a Lie group G that is a submanifold of G is a Lie group; H 
will be said to be a Lie subgroup of G, Indeed the map (a, y) > cy + from Gx G 
to G maps the submanifold H x H to H. It is therefore C® as a map from H x H 
to H (Chap. IX, n° 13, exercise 3). A Lie subgroup is closed (and conversely, but 
it is far less obvious). Its closure H’ in G is indeed a subgroup in which H is open. 
This is a general property of submanifolds [Chap. IX, n° 13, end of section (i)]. 
However, H’ is the union of cosets «H, which, like H, are open in H’ and pairwise 
disjoint, hence closed in H’. In particular, H is closed in H’, hence in G, qed. 

The next general result leads to many examples of Lie groups. Let us suppose 
that a Lie group G acts on a manifold X in such a way that the map (g,x) +> gx 
is C°. Then the map j : g +~+ ga has constant rank for all x € X, ie. is a 
subimmersion [Chap. IX, n° 12, (iii), end]. Indeed for all a € G, j(ag) = aj(g). The 
tangent linear map to j at ag is therefore the composition of the tangent map to 7 
at g and of the tangent map to «+> az at j(g). Now, the latter is an isomorphism 
like x +> ax, qed. 

This being settled, it follows that, for all  € X, the subgroup Gz of g such 
that gx = x is a submanifold of G [Chap. IX, n° 13, (ii)], hence a Lie group. 

For example, the subgroup of G defined by gs = sg is a Lie group for all s € G: 
make G act on G by (g,2) ++ gag”. If G = GLn(R), G can also be made to 
act on the right by (g,s) ++ g’sg, where g’ is the transpose of g. The group of 
automorphisms of a non-degenerate bilinear form, for example orthogonal groups 
in the symmetric case or symplectic groups in the alternate one, is therefore a Lie 
group. Making G act on R* by (g,x) +> det(g)a shows that SL,,(R) is a Lie 
subgroup of G. 

Every C® homomorphism f : H +~> G from a Lie group to another is a 
subimmersion: make H act on G by f (h,g) +> f(h)g. The kernel of f is therefore 
a Lie subgroup of H. On the other hand, the image of f is not necessarily a Lie 
subgroup of G. 

Indeed, apart form subgroups that are genuine closed submanifolds, a Lie group 
G can contain subgroups H which, though not submanifolds, are equipped with the 
structure of a Lie group such that the identity map H —> G is an immersion. Lie 
group theory cannot be constructed without giving as much importance to these 
“bad” subgroups as to the “good” ones. 

In particular, one-parameter subgroups whose elements are homomorphisms 
t ++ ¥(t) from R to G, play a fundamental role. Such a homomorphism only 
need to be continuous to be C°° — obvious if G = GL,,(R). It is then an immersion, 
unless 7(t) = e for all t, since the map has rank 1 everywhere or rank 0 everywhere. 
In the general case, the kernel of ¥ is either {0}, or a discrete subgroup wZ of R. In 
the latter case, the image y(R) is a compact subgroup, and 7(R) is a 1-dimensional 
Lie subgroup isomorphic to R/wR. But in the former case, 7(R) may be everywhere 
dense [Chap. IX, n° 13, (iii)]. 
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We show that a continuous homomorphism from a Lie group to another is C™. 
This is particularly easy to prove for finite-dimensional continuous linear represen- 
tations (H, 7) of a Lie group G, which are homomorphisms to some GLn. For any 
function y € D(G), i.e. C°° and with compact support, the function 


x(a)m(y) = x(2) i, x(y)oly)dy = if x(ev)oly)dy = / a(y)y (27*y) dy 


is indeed C®. This follows from the simplest theorems on differentiation under 
the f sign. On the other hand, there exist y whose support is an arbitrarily small 
neighbourhood of e, hence Dirac sequences composed of functions yn, € D(G). So 
the unit operator is a limit of operators 7(y). But in finite dimension, an everywhere 
dense vector subspace is the whole of the space. Hence there exist y such that 
m(y) = 1, qed. 

A final remark: every Lie group can be equipped with an analytic structure 
compatible with its C® structure, i.e. for which change of charts functions and 
the composition law (x,y) > xy’ have real power series expansions. Such an 
analytic structure is unique. 


(ii) Operations on tangent vectors. Let G be a Lie group. Since, for all a € G, the 
map g+—> ag is a diffeomorphism, the tangent map G’(g) —> G’(ag) is bijective. 
It is then natural to denote by ah the image of h € G’(g) under this application. 
ha can be similarly defined. We check that a(hb) = (ah)b by differentiating the 
identity a(ab) = (ax)b. Hence we can talk of ahb without ambiguity. If h is the 
tangent vector at t = 0 to some path 7(t) on G, then 


(25.1) df (agbaRsyS < flaOu fort 20: 


Let us now set m(x,y) = xy and calculate 
m' (a,b) : G’(a) x G'(b) —> G'(ab). 


To do this, we need to differentiate (x,y) > f(xy), hence®® add the differentials 
of +> f(xb) and y+-> f(ay). As a result, 


(25.2) m'(a,b) : (h,k) + hb +ak. 


In Leibniz style: d(xy) = dx.y + x.dy. In particular, m’(e,e) is addition (h,k) -3 
h+k in G’(e) even if G is not commutative. If h and k are given by h = 9‘(0), 
k = 6'(0) where y and 6 are curves over G, vector (2) is tangent to t + ¥(t)6(t) 
at t = 0. Generally speaking, setting 


°8 We recall the calculus rule of Chap. IX, n° 12, (iv): let f(z, y) be a map from a 
product manifold X x Y to a manifold Z. Denote by dx f(a, b) the differential 
of «+> f(x,b) at a and by dy f(a, b) the differential of y+ f(a, y) at b. Then 
the “total” differential of f is given by 


df ((a, b); (h, k)) = dx f ((a, b); h) + dy f ((a, 6); k) 5 


For obvious reasons, I will call this result the differentiation formula of a product. 
Together with the chain rule, it shows that given maps 7 : T —> X and 6: 
T —> Y from a manifold T to X and Y and setting F(t) = f[7(t), 6(t)], the 
tangent linear map F’(to) at a point to is the sum of the tangent maps to t +> 
F[y(t),6(to)] and t+ Fly(to), 6(@)]. 


§9. SL2(R) as a Lie Group 465 


Dof(t) = f'(0) = lim [f(t) — f()] /t 


for any differentiable function at t = 0, which contradicts taboos related to free 
variable, we therefore get 


(25.3) df (ab; hb + ak) = Dof [y(t)6(6)] 

for all f € C’(G). Setting F(x, y) = f(ay), formula (3) becomes 

(25.4) DoF [(t), 5(¢)] = DoF (x(t), 0] + DoF a, 6(¢)] 

as a consequence of the differentiation rule of a product. More generally, 


(25.5) <y(t)6(t) = 7 (t)5(t) + y(t)0"(#) 


for all t, both sides being in the tangent space at 7(t)d6(t). In particular, if 6(¢) = 
y(t), then 


7 (t)5(t) + 7(t)5"(t) = 0, 


whence 

(25.6) Lyi? = yt) Oe). 
Even more particularly, 

(25.6’) Do [y(t)"*] = —Do ly] if 7(0) =e. 


This result says that the tangent map to r+ a1 at e is X H> —X. 

Since g ++ ga induces an isomorphism from G’(e) onto G’(a), the manifold 
T(G) of tangent vectors to G can be identified to the product G’(e) x G by associ- 
ating the vector Xa € G’(a) to the ordered pair (X,a). The law of composition m’ 
of T(G) being associative, 


XaYb = X.aYa'.ab = X.ad(a)Y.ab 
where the image vector 
(25.7) ad(a)Y =aYa"', 


under the tangent map to « + ara! at e, is still in G’(e). The map g +> 
ad(g)X is a linear representation of G on G’(e), the adjoint representation of G. 
If G is a linear group and if the elements X € G’(e) are identified to matrices or 
endomorphisms, then obviously 


ad(g)X =gXg"". 


(iii) Differentiation and invariant vector fields. Translations on G enable us to 
associate two C™ vector fields, namely 


(25.8) Lx :gro-Xg, Rx:grogX, 
to every X € G’(e). They satisfy relations 


L(ga) = L(g)a, R(ag) = aR(g) 
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and these clearly characterize them (put g = e). If (X;) is a basis for G’(e), the 
vectors gX; (or Xig) form a basis for G’(g) for all g, so that any vector field on G 
can be written as a linear combination with variable coefficients of gX; or Xig. 

However, we saw in Chap. IX, n° 14 that to every vector field L on a manifold, 
one can associate a differential operator given by 


Lf (x) = df(x; L(x) 


for all x € G and all function f differentiable everywhere with values in C (or even 
in a Banach space). We will denote by L(X) and R(X) the operators associated to 
the vector fields Lx and Rx. Hence 


(25.9) R(X) f(g) = af (93 9X) = Dof [gy] . 
(25.9”) L(X) f(g) = af(9; -Xg) = Dof [y(t)*9] 


if X = ¥'(0). Clearly R(X) (resp. L(X)) commutes will left (resp. right) trans- 
lations. Operators R(X) commute with operators L(Y), for if X = y'(0) and 
Y = 6'(0), we get R(X)L(Y)f(g) by differentiating L(Y) f[gy(s)] at s = 0, hence 
by differentiating f[d(t)~'gy(s)] with respect to s and t at s = t = 0. The order of 
differentiation is irrelevant. Denoting by R(x) and L(x) the translations operators 
given by®? 


R(x) f(9) = f(gz), L(a)f(g)=f (eg) for rec, 
it is at least formally possible to write that 
(25.10) R(X) = DoR(t)], L(Y) = DoL [6(t)] - 


These vector fields are related to one-parameter subgroups of G. Indeed, since 
Rx(ag) = aRx(qg), left translations permute the maximal integral curves of the 
vector field Rx (Chap. IX, n° 15). If 7(¢) is the maximal integral such that y(0) = e, 
7'(0) = X, then by definition y’(t) = y(t)X. However, if I is the definition interval 
of y(t), for all s € J and all vector fields, the curve t > y(s +t) is the maximal 
integral with initial point y(s). The curve t > 7(s)~'y(s +t) = d(t) is therefore 
also a maximal integral of Rx, such that 6(0) = e, 6’(0) = y(s)~'y'(s) = X and 
defined in J — s. As a result, J = I — s. Thus J = R and y(s + t) = ¥(s)y(t) for 
all s, t. Hence the integral curve >(t) is a one-parameter subgroup, which will be 
denoted by yx (t). 

Conversely, for any one-parameter subgroup, 7(t) is obtained in this manner: 
by differentiating the equality y(s+t) = = ¥(s)y(t) at s = 0, we find y(t) = Xy(t) = 

a(t)X = Rx [y(t)] where X = +'(0). 
If G is a Lie subgroup of GL,(R) and X is identified to a matrix, then 


x(t) = exp(tX) = }o(ex)™ 
[Chap. IX, n° 15, (v)], which, in the general case, explains the conventional notation 
(25.11) yx (t) = expc (tx). 
As a homomorphism 7 : G —> H of Lie groups transforms one-parameter sub- 


groups of G into their analogues in H, clearly, 70 expg = expy 07’ (e). For G = H 
and n(x) = grg”t, 


°° Attention should be paid to the definition of operators R(ax) adopted here: it is 
different from that of Chap. XI, n° 31. 
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(25.12) g.exp(X)g7* = exp [ad(g) X] 


follows. So g such that ad(g) = e are the elements of G commuting with exp(X) 
for all X, hence with the subgroup they generate. We will see later that it is in G 
if G is connected. In this case, the kernel of the adjoint representation is, therefore, 
the centre of G. 

If (H,7) is a finite dimensional continuous linear representation of G, hence 
C@, then, setting 


(25.13) Tl (Xx) = nm (e)X = Dor ly(t)] 
for all X € G'(e), 
tt [expg(X)] = exp [7(X)] 


where the right hand side is the usual exponential of a matrix. One readily deduces 
that, if G is connected, then an operator T :H —> H commutes with all 7(g) if 
and only if it commutes with all 7(X). Likewise, subspaces of H invariant under 
the 7(g) or the 7(X) can be shown to be the same. 

In fact, invoking the exponential map to get these results is unnecessary. In- 
deed if G is made to act on the manifold of endomorphisms of H. by (g,7) +> 
m(g)I'n(g)~', by section (i) above, for all T, we get a map of constant rank. To 
show that it is constant in the neighbourhood of e (hence in G if G is connected), 
it is sufficient to show that its rank at the origin is zero. But if y'(0) = X, then 


(25.14) Do [x (v(t) Px (y(t) *] = a(X)P - Tr(X) 
by (6’) and the differential rule of a product, whence the result. 


(iv) Canonical coordinates. As the vector field Rx depends in a C'° manner 
on the parameter X, the map (X,t) +> expg(tX) is C® (Chap. IX, n° 15), hence 
so is X ++ expg(X). On a neighbourhood of 0, but not globally in general, it is a 
diffeomorphism because the tangent map to expg at 0 is X +> Do expg (tx) = X. 
The subgroup generated by the image of expg, i.e. the set of products 


exp (X1) ...€Xp (Xn) ’ 


is therefore open in G; it is G if G is connected. 

For any sufficiently small neighbourhood V of e in G, there is a unique diffeo- 
morphism log, from V onto a neighbourhood W of 0 in G’(e) such that exp, o log, 
is the identity on V, whence a chart (V,log,), called canonical,’ from G to V. 
For all g € G, the map Y +> g.exp(Y) is, therefore, a chart of G in the open set 
gV. It follows that a function f is of class C? on G if and only if, for all g, the map 
Y +> flg.exp(Y)] is of class C? on W, i.e. has continuous partial derivatives of 
order < p. For example, the first order derivatives are the functions 


(25.15) Y ++ Dof [9.exp(¥ + tX)] 


where X varies in a basis of G’(e). For Y = 0, we recover the functions 


R(X) f(g) = Dof [g- exp(tX)] 


defined above by supposing that f is differentiable, but due to the previous for- 
mula, they can be defined whenever they are well-defined. If f is of class C?, these 
functions exist and are of class C??. 


100 The notation exp and log will be used when there is no possibility of confusion. 
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The converse is obvious if f is differentiable at every point of G, since then 
R(X) f(g) = df(g; gX). However, in every local chart, differentiation operators with 
respect to these coordinates are linear combinations with C'™ coefficients of the 
operators R(X) associated to the vectors X in a basis of G’(e). But it is not a 
priori obvious that the existence and the continuity of functions R(X) f imply the 
differentiability of f, because the derivatives R(X) f(g) are computed in a chart 
depending on g contrary to derivatives (15). Hence a proof is needed. 


Theorem 39. Let G be a Lie group. A function f defined on G is of class C? if 
and only if, for all X € G'(e), the derivative 


(25.16) R(X) f(g) = Dof [9-exp(tX)] 
exists and is a function of class C?~' on G. 


Let us first show that the map X +> R(X)f is linear. Indeed, suppose that, for 
given X, the derivative R(X)f exists and is a continuous function of g. Replacing 
g by g.exp(tX), the function t > f[g. exp(tX )] is seen to be differentiable and its 
derivative 


Df [g. exp(tX)] = R(X)f [g-exp(tX)] , ot D=d/dt, 


to be a continuous function of (g,t). Hence, if M is a compact subset of G, the 
right hand side is bounded for g € M and |¢t| < 1. Integrating with respect to t then 
gives an inequality of the form 


If Ig-exp(tX)] — f(g)| < eM) It 


which holds for all g € M and |t| < 1. Then for any y € D(G), (dominated 
convergence) 


eg el [g-exp(tX)] — f(g 


t=0 t o(a)deg ~ [ROMO 29. 


But the left hand side also equals 


tim [ (gE) — 9(9) De 


As y € D(G), the quotient converges uniformly to the function —R(X)y staying 
zero outside a fixed compact set. It is therefore possible to pass to the limit under 
the f sign. Hence 


(25.17) i R(X) f(9)o(g)deg + i: f(g)-R(X)e(9)deg = 0 


for all y € D(G). The right hand side being a linear function of X, the linearity of 
R(X)f follows from the assumption of the theorem. 

To show that f € C?(G), it is convenient to use canonical charts 
kind”. They are defined using a basis (X;)i<i<n of G’(e) and a map 


“of the second 


(25.18) X= 1X1 tee t tiXx t— exp (t:.X1) ...e€xp (tnXn) 


As the tangent vector to t +> exp(tX) at t is, by definition, exp(tX)X, the differ- 
entiation formula of a product shows that the tangent map to (18) at X = 0 is the 
identity. Map (18) is thus a diffeomorphism in the neighbourhood of 0. A function 
f is then of class C? if and only if so is the function 
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F(ti,...,tn) = f [g. exp (1X1)... exp (inXn)| 


on R” for all g. 
To compute the derivative D;F' with respect to t:, we use by definition the 
expression 


f [g exp (t1.X1)... exp (:.X; + AX;)...exp (tnXn)] . 

Setting 

x =exp(tiX1)...exp(trnXn), vi = exp (ti41Xi41)... exp (tnXn) , 
it equals 

f [gx.2;* exp (hX;) zi] = f [gv.exp (hXj)] ot Xj =ad (a;)7' Xi. 
Thus 

Di F(t) = Dof [gx. exp (tX;)] = R (Xi) f(ge), 

the existence of the left hand side being equivalent to that of the right hand one. 


Hence these derivatives exist if R(X) f is well-defined for all X. However, there are 
relations of the form 


(25.19) ad(g)"?Xi = Yo aula) Xj 
with coefficients ai; € C°(G), whence 

Xj= Sai (ai) X;. 
As R(X) f(g) is a linear function of X, it follows that 


D;F(t) = ye aij (ai) R(X5) f(gx) 


where ai;(xi) are C® functions of the coordinates ty. For given g and under the 
assumption of the theorem, the right hand side is, therefore, a function of class 
C”~! of the tx, ged. 

We will return to this “obvious” theorem in the next n° in the context of C'° 
vectors in infinite dimensional representations. 


(v) The Lie algebra of a group. There are several infinitesimal methods to em- 
phasize the non-commutativity of multiplication in a Lie group G. The simplest 
consists in considering vectors X,Y € G’(e) defined by curves y(t) and 6(t). Ap- 
plying formula (9’) twice shows that 


(25.20) R(X)RYY)F(9) =< flov(s)6()] for s=t=0 
(25.20) RYYIR(X) (9) = Fs [gd(t)v(s)] for s=1=0. 


The Jacobi bracket R = [R(X), R(Y)] = R(X)R(Y) — R(Y)R(X) of the two oper- 
ators considered is, therefore, given by 


2 


(25.21) RAG) = {I lorls) (0) = F 19(0) (5) 
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at s =t =0. Now, like R(X) and R(Y), R is defined by a vector field (Chap. IX, 
n° 14). Like Rx and Ry, it is right invariant. Hence there exists Z € G’(e) such 
that R= R(Z). We set Z = [X,Y]. So by definition, 


(25.22) R([X,Y]) = [R(-X), R(Y)] . 


For g = e, the previous formula shows that, for all f € C?(G), 


(25.23) df (e;[X, Y]) = {f ly(s)6(t)] — Fl6@)r(s)] - 


This expression needs to be calculated at s = ¢ = 0. A similar calculation would 
also show that 


(25.23’) L([X, Y]) = [£(X), L(Y) . 


If t is a homomorphism from G to a Lie group H, then denoting by z‘(e) the 
tangent map to 7 at the origin, once again 


(25.24) n'(e)([X,¥]) = [x'(e)X,0'(e)Y] 


because 7 transforms the product 7(s)6(t) of two paths defining X and Y into the 
product of paths 70 7(s) and 70 6(t), which are tangent to the vectors 1’(e)X and 
m’(e)Y at the origin. If 7 us a finite-dimensional representation of G, for f = 7, the 
function to be differentiated in (23) equals 


m [y(s)] [5] — [6] ty(s)] - 
The derivative of s > x[y(s)] at the origin being by definition 7(X), 
m ([X, Y]) = [7(X), 7(¥Y)| 


follows, where these are now linear operators. 

Exercise 1. Let y(t) and 6(t) be two paths with initial point e tangent to X and 
Y. Differentiating the expression f[y(t)d(s)y(t)~'] with respect to s and t, show 
that 


(25.25) [X,Y] = Doad [y(#)Y] . 

Hence, setting ad(X)Y = [X,Y], 

(25.25) ad(X) = Doad [y(t)] 

for every path defining X, for example yx (t) = expc(tX). We have 
ad ([X,Y]) = [ad(X), ad(Y)] . 


Exercise 2. If x,y are elements of a group G, their commutator is the element 
[x,y] = xyx~'y~' which measures the deviation of yx from xy. Suppose that G C 
GL,,(R) and consider paths y and 6 with initial point e defining vectors X,Y in 
G'(e) C M,,(R). Using limited expansions of order 2 of y(t) and 6(t), show that 


y@), 6] = 1+? (XY -YX)+0() 
and that XY — YX is the tangent vector to t -> [y(t2), 5(t2 )]. 


The vector space G’(e), equipped with the Jacobi bracket [X,Y], is the Lie 
algebra of G and is usually written g. In the rest of this §, it is better to keep 


§9. SL2(R) as a Lie Group 471 


this notation for the corresponding complex vector space. The elements of g can 
be identified with differentiation operators having left and right invariant complex 
coefficients. 

The bracket [X,Y] is bilinear in X,Y, alternate and, like the corresponding 
differential operators, satisfy the Jacobi identity 


LX, [Y, Z]] + [Y, [Z, X]] + [Z, [X, Y]] = 0. 
Setting 
ad(X)Y = [X,Y], 
the previous relation means that 
ljad(X), ad(Y)] = ad([X, Y]) . 


A vector space equipped with a bilinear composition law (X,Y) +> [X,Y] sat- 
isfying the preceding formulas is a Lie algebra. A linear map a from the latter 
to the space of operators of a vector space H is a representation if m([X,Y]) = 
[(X), 7(Y)]. 


(vi) Lie algebras of classical groups. Let us show how to determine the Lie 
algebras of the groups defined in section (i) using subimmersions. If a Lie group 
acts on a manifold X, the maps g +—~ ga have been seen to be subimmersions. If 
fZ is the stabilizer of a in G, the Lie algebra of H is thus the kernel of the tangent 
map to g+— ga ate. 

For the map «+> xsx~' = y from G to G, we differentiate equality xs = yz. 
This gives dz.s = dy.x + y.dx and, for x =e, 


dy = dz.s — y.dx = dx.s — sdx. 


The kernel of dx ++ dy is thus defined by dz.s = s.dx, whence H’(e) = Ker[ad(s) — 
1]. 

For G = GL,,(R) acting on itself by x +> 2'sx = y (where 2’ is the transpose 
of x), dy = dz’.s+s.dz at x =e. So H'(e) is the set of X € G’(e) = Mn(R) such 
that X’s+sX =0. If s = 8’, this means that the matrix sX is antisymmetric, and 
if s’ = —s, that it is symmetric. This gives the Lie algebras of the orthogonal and 
symplectic groups. 

The group SLn(R) of real matrices with determinant 1 is obtained by making 
GL,,(R) act on R* by(g, x) +> det(g)a. All that is needed is to compute the tangent 
map to the determinant at g = e. If (e;) is the canonical basis of R”, then 


det(g) = Dg (e1),-.--,9 (en)] , 


where D is the n-linear alternate form equal to 1 at (e1,...,en). The differentials 
of the functions g +> g(e;) at the origin being X +—> X(e;), the differential of the 
determinant at e is the linear functional 


X +> D[X (e1),€2,..-,en] +... + D lei,...,€n—1, X (€n)] = Tr(X) 


on G’(e). The Lie algebra of the group G = SL,,(R) is therefore the set of matrices 
with zero trace. 

In all these examples of subgroups of GL, the Jacobi bracket [X, Y] is obviously 
the matrix XY —YX. 


(vii) Invariant distributions and differential operators. For every manifold G of 
class C™, it is possible to define a Schwartz space D(G) C C'™°(G) of C™ functions 
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with compact support and, for every compact set M C G, the subspace D(G; M) 
of functions vanishing outside M. These subspaces are stable under all differential 
operators’®! defined in Chap. IX, n° 14. This enables us to equip each D(G; M) 
with a locally convex topology using seminorms 


(25.26) pL(y) = sup |Ly(g)| = |lLell,. ; 


where L is such a differential operator. As L is a linear combination of products of 
operators R(X) or L(X), whichever, we may only consider seminorms 


|R (X12)... R (Xp) P(MDlloo » 


where the X; are in G’(e), or even belong to a basis of G’(e). Likewise, the semi- 
norms supge yy |Ly(g)|, where M is an arbitrary compact subset, enable us to define 
a topology on C®(G), that of compact convergence of functions and all their deriva- 
tives. 

Based on this, the notion of a distribution on G becomes obvious: it is a linear 
functional!°? 


bier wy) = J eeiau(c) = (YH) 


on D(G) whose restriction to each D(G; M) is continuous. This means that, for any 
compact set M, there are finitely many differential operators L; such that 


(25.27) lu(y)| < So [Lig], for all p € D(G; M). 


Here we will mostly use distributions with compact support, i.e. for which there is 
a compact set M such that p(y) = 0 if g = 0 in a neighbourhood'® of M. The 
smallest possible M is the support of wu. u(y) can then be defined for all p € C™(G) 
by setting u(y) = p(y’) for all y’ € D(G) equal to y in a neighbourhood of M, the 
result being a continuous linear functional in C°°(G): inequality (27), where the 
right hand side is computed in a fixed compact neighbourhood of p, then holds for 
all y € C™(G). Such a distribution is of the form’”* 


101 Linear, of arbitrary order and with C’™ coefficient will always be implied. 

102 The integral notion is far more convenient than that of Schwartz, especially in the 
context of Lie groups. The shortest presentation of distribution theory is prob- 
ably that given by Lars Hormander in the beginning of his Linear Differential 
Operators. 

103 Tf h € G’(a), y(a) = 0 does not imply dy(a;h) = 0... 

104 Let M be a neighbourhood of of the support of j1 and suppose there is an upper 
bound |u(y)| < ye;<, ||Livl|lac in D(G; M), where this the norm of uniform 
convergence on M. Consider the product space H = L(M)” equipped with the 
topology of uniform convergence, where L(M) is, as usual, the set of continuous 
functions on M. py can be considered a continuous linear functional on the sub- 
space M CH of (Liv)i<i<n where y € D(G,M). The Hahn-Banach theorem 
then proves the existence of a continuous linear functional on extending wp. 
Such a form is obviously given by 


(P1s--+1 Pn) -— Sr (v3) 


where the ju; are measures on M. Thus p(f) = >> wi(Lif) for all f € D(G; M). 
Generalization to all of D(G) is immediate. 
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(25.28) u(y) = 0 ws (Lig) 


where the pu; are finitely manymeasures with compact support and the L, are finitely 
many differential operators, which may be assumed to be invariant if G is a Lie 
group. The converse in obvious. 

Any tangent vector at t = 0 to a curve y(t) with initial point a € G is such a 
distribution, namely y +> dy(a;h). This can be written as 


(25.29) ay(ash) = | oa)dh(x) = Dovlr(t). 


However strange this notation may seen — it is only a notation —, it makes it possible 
to unify integral and infinitesimal methods in representation theory. 

Like in measure theory, the product of two distributions \ and yw is defined by 
the formula 


er ff ole.warte)auy) = fay) f ote.v)ara), 


It is always well-defined if one of the two distributions has compact support, 
because if y and A, for example, have compact support, so does the function 
yr > f v(a,y)dX(x) and it is also C°. This can be seen using a generalization 
to distributions of theorems on differentiation under the f sign. The essential step 
consists in showing that, if y(t) is a C® curve, if y € D(G,M) and if M’ isa 
compact neighbourhood of M, then the function 


glx, r(t)] — 9 lx, (0)] 
t 


converges to its limit in the topology of D(G, M’) as t tends to 0. 

The Lebesgue-Fubini formula applies for all y € C°°(G) if \ and yz have compact 
supports; and (28) reduces the question to the case of measures. 

Let us next suppose that G is a Lie group. Like in Chap. XI, n° 25, it is then 
possible to associate to A and yu their convolution product, namely the distribution 


(25.30) A* MI pr—> me p(xy)drX(x)du(y) , 


at least if \ or has compact support. The product of three distributions is as- 
sociative if two of them have compact support. It is even possible to associated 
convolution operators in C'°(G) to every distribution 4 with compact support, 
namely 


(25.31) E(u) f(a) = f f (ye) duly) = 1 fl), 


(25.31’) R(u) f(a) = i fley)duly) = f*u'(2), 


where du’ (x) = du(x~'). Since any locally integrable function f defines a distribu- 
tion p++ f v(x) f(x)dx, w* f and f * can be defined for any distribution j, with 
compact support. But the result may be a genuine distribution, not a function. 
Operators L(y) (resp. R()) clearly commute with right (resp. left) translations. 
If y is the distribution with support {e} defined by some X = y'(0) € G’(e), then 


X «9(g9) = fe (x~*g) dX (x) = Dog [y(t)*9] , 
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whence 

(25.32) X * p(g) = —dp(g; Xg) = L(X) (9). 
Similarly, 

(25.32’) R(X)p(g) = —~ * X(g) = dyl(g; 9X). 


This supposes y € C™(G). 
Since R(X)p =—-y*X, 


R(X)R(Y)p — RIV) R(X)p = (px Y) *& X —(px X)*Y. 


Hence, the left hand side being equal to R([X,Y])p = —y * [X,Y], in terms of 
distributions, 


(25.33) [X,Y] =X*VY-YVX. 


Formulas (32) and (32’) suggest a generalization to distributions: for any dis- 
tribution on G, we will set 


(25.34) L(X)w=Xep, R(X)p=—pxX. 


By (30), X * us is the distribution 


eo ff elevax duty) = faut) f Fledx(@) = 
= [ Doe truant) == f EQ etw)anty). 
Hence, using Schwartz’s notation (y, w) = u(y), we get equivalent formulas 


(25.35) (y, L(X) pm) + (L(X) 9, mw) = 0, (p, X * yp) +(X * Yu) =0 


similar (and for good reason...) to the definition of the derivative of a distribution 
of a variable [Chap. V, formula (35.3)]. It can similarly be shown that 


(25.35’) (9, R(X)u) + (R(X)p,u) =0, (yw X) + (op Xiu) = 0. 
This relation can in particular be Gaaeat : du(g) = f(g)drg where f is a locally 
integrable function, i.e. if (yp, #) = f v(x) f(x)drx for all y € D(G). Formula (17) 


obtained by proving theorem 39 therefore ae that, if the function R(X)f(g) = 
Dof|g.exp(tX )] exists and is continuous, then 


(25.36) jp Reor dg + [fo 6 (g)d-g =0 


for all y € D(G). This means that existence and continuity of the function R(X) f(g) 
imply 


(25.36) R(X)f =—-f*X in the sense of distributions . 


The two possible interpretations of R(X) f are therefore compatible in this case. It 
can similarly be shown that by identifying f with the distribution f(g)dig, existence 
and la continuity of L(X) f(g) = Dof|g.exp(—tX)] imply 


(25.36” ) L(X)f =X *f in the sense of distributions. 
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Incidentally, observe an important consequence of (36). Using the inner product 
on the space L?(G) of the measure dyg, (36) shows that (R(X)y|7)) = (y|R(X)?)) 
for all y, » € D(G). Considered an operator in D(G), iR(X) is therefore symmetric. 
Hence, if (X;) is a basis of G’(e), the operator }> R(X;) is symmetric > 0. In the 
language of PDE theory, this means that it is an elliptic differential order of the 
second order, analogous to the usual Laplacian. 

Linear combinations of distributions X) *...* X,, with X; in G’(e), are precisely 
the distributions with support {e} or, equivalently in any manifold, those for which 
u(f) is a linear combination of partial derivatives of f at e, in every chart at e. 
Definition (30) of a convolution product shows that if X; = yj (0), then X1*...* Xp 
is a linear functional 


pr Dy... Dpy [11 (t1) --- Yp (te) 


the derivatives D; with respect to t; being calculated at 0. 

The set U/(g) of these distributions is an infinite dimensional associative algebra 
generated by X € G'(e). It is the universal enveloping algebra of the complex 
Lie algebra'®’ g, systematically used from the start by Harish-Chandra, who was 
probably inspired by Chevalley. Associating the operator L(y) (resp. R(j)) defined 
in (34) to each yp € U(g) gives all the differential operators on G commuting with 
the translations, because such an operator L defines a distribution y +> Ly(e) 
with support {e} which it is fully determined by due to its invariance. So U(g) can 
be identified with the algebra of left (resp. right) invariant differential operators, 
which I will instead denote by U(G). For all p, the set U,(G) of operators of order 
< p is finite-dimensional. 

The definition of convolution products readily shows that, for all X € G'(e) 
and alla €éG, 


aX =€a*X, Xa=X *€q 


provided tangent vectors aX and Xa to G at a are identified with distributions 
having support {a}. This gives the formula 


ad(a)X = €a * X * €g-1. 


A similar formula allows ad(a) to be defined for all distributions on G. It is the 
image of j: under the automorphism x +> axa~' of G. If y is of the form f(x)dz, 
ad(a)1 is defined by the function f(a~'axa) which we could denote by ad(a) f(z). 

Central distributions, i.e. invariant under operators ad(a), are easily character- 
ized: they must satisfy 


X *uU= xX 


for all X € G’(e), and this condition is sufficient if G is connected. To see this, we 
compute the differential of the function 


gt (¢,ad(g)) = f 2 (9297) due) 


105 To tell the truth, this how an “abstract” algebra associated to any Lie algebra 
is called. But for Lie groups, it is isomorphic to the one in the text. On this 
topic, see J. Dixmier’s book Algébres enveloppantes (Gauthier-Villars, 1974), or 
N. Bourbaki’s first volume, Groupes et algébres de Lie (the chapter on Lie groups, 
dedicated to infinite-dimensional analytic groups over more general fields than R 
or C, is practically unreadable. The founding members, who had left the group 
when it was written, would never have allowed such a monster... ). 
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for y € D(G). It is possible to differentiate under the f sign and to use the differ- 
entiation formula of a product. So the result is 


Dog [y(t)gxg7'y(t)~*] = Dov [y(t)gxg7*] + Dov [gxg7*(t)~"] 
=-Xx*o (gxg') +p*X (grg"') 
if X = ¥'(0). Hence, if ad(g)u = p, 
(X * 9, u) = (p* X, pH) - 


So by (35) and (36), the condition is necessary. The converse follows from similar 
computations. 

The operators ad(g) are homomorphisms with respect to the convolution prod- 
uct. On the other hand, they map U/,(G) to U,(G) for all p, thus defining linear 
representations of G on the spaces U/,(G). Denoting them by ad,(g), there are 
operators 


adp(X) = ad,(e)X 
corresponding to them. Previous calculations show that 
adp(X)u = X * w— w* X 
like in G’(e). Calculating ad,(v) for an arbitrary distribution v € U(G) is a dif- 
ferent problem altogether, as the reader will convince himself easily. 


If z is a linear representation of G on a finite-dimensional vector space H, as in 
the case of the measure in Chap. XI, the operator 


(25.37) m(p) = / m(g)du(g) 


can be associated to every distribution 4 with compact support. The formula is 
well-defined since m(g) is C°°. The definition of convolution products shows that 


(25.38) m(A * pb) = TA) (pH). 


If y is a vector X € G'(e), we recover 7(X) = 1'(e)X. 


26 — Lie Algebras in Infinite-Dimension 


(i) The subspace H®. It was shown above that every finite-dimensional representa- 
tion (H, 7) of G has an associated Lie algebra representation g obtained by setting 


thy@la-a 
t 


(26.1) m(X)a = Dor [7(t)] a = lim 


for every tangent curve to X at t = 0. In infinite dimension, though the situation 
is not intractable, it is considerably more complicated. It has given rise to some 
difficult theorems’ and entails many pitfalls. 

Let us consider a continuous representation 7 of G on a Banach space H and, 
for a € H, the function p(g) = 7(g)a. Even if derivative (1) exists for all tangent 


106 This section follows closely the more complete though older presentation of 
G. Warner’s in Harmonic Analysis on Semisimple Lie Groups (Springer, 1972), 
vol. I, pp. 252-304. 
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curves y(t) to a given X, it is not a priori obvious that the result only depends on 
X. To avoid this ambiguity, we set 


(26.2) m(X)a = Dor [yx(t)]a 
each time this derivative exists. As 


nlgy@la=z7(g)tlyOla, 


the operator R(X) defined by (25.16) can be applied to the function 7(g)a. The 
result, namely 


(26.3) R(X)r(g)a=x(g)r(X)a, 


proves that g +> R(X)zx(g)a is continuous. Theorem 39 then shows that the ex- 
istence of derivative (2) for all X is necessary and sufficient'”’ for the function 
m(g)a to be of class C'. 

The set H® of a € H such that the function 7(g)a is C™ is, therefore, the set 
of vectors for which the expression 


(26.4) a (X1)...0(Xp)a 


is well-defined for all p and X;. We will denote by 7°(X) the restriction of 7(X) 
to H® for all X. Generalization to distributions uw € U(G) is obvious, with 


n(x v) = 2 (y)n™(v). 
The subspace H® is 7(g)-invariant since 


107 This point is not always properly justified. See for example A. Knapp, Represen- 
tation theory of semisimple groups... (Princeton UP, 1986), proof of lemma 3.13, 
p.55: For each a the map g +> 1r(g)7(X)a is continuous, and thus m(g)a has 
continuous first partials everywhere and must be of class C' (I have changed 
Knapp’s notation) ; N. Wallach, Real Reductive Groups (Academic Press, 1988), 
vol I, pp. 33, lemma 1.6.4, uses the same argument to show that H®™ is complete ; 
Warner, Harmonic Analysis..., vol I, p. 252, Example, also uses it to show that, 
in the regular representation of G on L?(G), the f € D(G) are C™ vectors 
(true, but for other reasons). D. Bump’s arguments, Automorphic Forms and 
Representations, lemma 2.4.2, p.188 are hardly more convincing. 

Knapp’s argument would be correct if the functions R(X)f(g) = 
Dof|g exp(tX)] were, as he says, partials, i.e. partial derivatives of f at g with 
respect to a local chart independent of g. However, as remarked above, they are 
calculated in a chart which depends on it. The whole problem is showing that if 
derivatives 


Dof [exp(Y) exp(tX)] 


exist and are continuous functions of Y € W, then the same is true for genuine 
partials 


Dof [exp(Y +tx)]| 


which Knapp does not write down. 

By the way, note that F. Bruhat, Sur les représentations induites des groupes 
de Lie (Bull. SMF, 84, 1956, pp. 97-205) is the first author to have systematically 
used C'® vectors, distribution theory and that of locally convex spaces to prove 
theorems (irreducibility of “almost all” principal series representations). 
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Dor [7(t)] x(g)a = m(g) Dor [9 *7(t)g] a = 7(g) Dor [A(t] a, 
where 6’(0) = ad(g)~‘y’(0), whence the existence of 
n(X)m(g)a = (q)m [ad(g)"?X] a. 
the result and the formula 
(26.5) n® [ad(g)X] = m(g)n°°(X)a(g)7} 


follow by iteration. We will see in the next section (ii) that operators 7°() can 
also be defined for every distribution with compact support. 
Then, like in finite dimension, 


(26.6) ma ([X, Y]) = [4° (X), er (Y)] in HO, 


because (3) shows first that R(Y)m(g)a = 7(g)7™(Y )a then, applied to 7*°(Y )a, 
that 


R(X)R(Y)r(g)a = R(X)r(g)r™(Y)a = n(g)r (X)r" (Ya. 


This give (6) since R([X, Y]) = [R(X), R(Y)] even when these differential operators 
are applied to functions with values in a Banach space. 


(ii) Weak differentiability and strong differentiability. In finite dimension, a func- 
tion with values in H is checked to be C? by verifying that so are its components 
with respect to some basis. It is then a trivial result applicable to all C° manifolds. 
This result can be generalized here, but less trivially. 

Let H’ denote the topological dual of the Banach space H. It is the set of 
continuous linear functionals on H equipped with the obvious norm. For x € H 
and y € H’, we set (x|y) = y(x). If p is a C® map from G to H, for any x € H’, 
the function 


(26.7) Px(g) = (P(g) |x) 


is clearly in C*(G) since it is the composition of p and a continuous linear map 
from H to C. 

To prove the converse, we first note that every a € H defines a continuous 
linear functional on H’, namely x > (a|x), which gives a canonical map from H 
to H” = (H’)’. It is easily shown (Hahn-Banach) to be isometric. So H is a closed 
subspace of H”. Hence to show that a map with values in H is C™, it suffices to 
show that it is as a map in the dual of H’. Replacing H by H’ reduces the proof to 
the next result whose author is unknown to me,: 


Theorem 40. Let H be a Banach space, G aC®™ manifold and p a map from G to 
the dual H' of H. p is C™ if and only if the function (x|p(g)) is C® for allx € H. 


The question being local, G may be assumed to be open in R”. We first suppose 
that n = 1 and set D = d/dt. By its very definition, the derivative D(x|p(t)) is the 
limit of a sequence of continuous linear functionals in x. Hence it is continuous.!°% 
Thus there is a vector p'(t) € H’ such that 


108 The proof given in Chap. XI, n° 19, (v), lemma 6 for Hilbert spaces readily gener- 
alizes. The Banach-Steinhaus theorem [Chap. XI, n° 15, (ii), note 52] can also be 
used. It shows that if lim(x|an) = f(x) exists for all x € H, then sup |lan|| < -++co 
and as a consequence |f(x)| < M||x||. More generally if M is a subset of H’ on 
which the function y > (x|y) is bounded for all x, then so is the function ||y]|| 
(“any weakly bounded subset is strongly bounded”). These arguments extend 
to more general spaces, for example Fréchet spaces. N. Bourbaki’s book, Espaces 
vectoriels topologiques, is the unavoidable reference for maximum generality and, 
almost always, for clearness and simplicity of proofs . 
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(26.8) D (x|p(t)) = (x|p'(t)) for all x. 


The left hand side being C(G), this result applies to p’. Thus there is a vector 
p(t) € H such that D?(x|p(t)) = (x|p’(t)), ete. 

The left hand side of (8) being bounded on every compact subset of G for all 
x, so is the right hand side, hence so is ||p’(t)|| (Banach-Steinhaus). In 


(xlp(t + h) — p(t) = [ (oclp'(u)) des, 


the right hand side is thus bounded above by M|h>].||x|| for sufficiently small |hl. 
Therefore, 


Ilp(t + h) — p(@)|| = O (lA) 


for sufficiently small |h|. Hence the functions p, p’, p”,... are continuous. 


(x[p(u +h) — p(u) — p'(u)h) = / (axlp'(u +t) — p'(u)) dt 
0 
then likewise shows that 
l|p(u+ A) — p(u) — Ap’(u)|] = off). 


As a result, p’ is the derivative of p in the strong topology, the function p is of class 
C’, hence so are p’,p”,..., proving the theorem for n = 1. 

For arbitrary n, the previous result, applied to the function p(u+tX) for given 
u€G and X € R", first of all shows the existence of the derivative Dop(u + tX) 
with respect to the strong topology. It depends linearly on X since, for all x, the 
differential of (x|p(u)) is linear in X. Setting Dop(u +tX) = p'(u)X defines a 
linear map p’(u) de R” on H’. The function ||p’(u)|| is bounded on every compact 
subset of G since so is the function (x|p'(u)X) = Do(x|p(u + tX)) for all X and x 
(Banach-Steinhaus). For sufficiently small ||hl|, 


(oclp(u +h) — plu)) = | (x|p'(u + th)h) at, 


then shows that ||p(wu + h) — p(u)|| = O(||Al|). Thus p is continuous with respect to 
the strong topology of H’. However, for all X € R” and all x € H, the function 


(x|p'(u)X) = Do (x|p(u + tX)) 


is in C*°(G) since (x|p(u)). Hence, like p, p’ is a continuous function with respect 
to the strong topology and ||p’(u)|| is bounded on every compact set. Writing 


1 
(x|p(u-+h) — plu) p'(u)h) = [ (tp"(u + thyh = p'(u)h) at, 
0 
we deduce that 
(26.9) ||p(u + h) — p(u) — p’(u)h]] = 0 (All) 
This shows that p is differentiable at u and that p'(u) is the tangent linear map to 


p atu. Since p’ is continuous, p is of class C'. Applying these arguments to p’,p”, 
etc., the existence and continuity of all partial derivatives of p follow, qed. 
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Corollary 1. Let p be a map from a manifold G to a Banach space H. p is C@ if 
and only if (p(g)|x) is C® for allx € H’. 


The proof follows by regarding p as a map with values in the dual’? of H’. 


Corollary 2. Let (H,7) be a continuous linear representation of a Lie group G 
on a Banach space H. Then H® is the set of a€ H such that (m(g)al|x) is C™ for 
allx EH’. 


Apply corollary 1 to p(g) = m(g)a. 


(iii) Convolution operators on H®. Let us first show how for any manifold, C° 


functions with values in a Banach space H can be “integrated” with respect to a 
distribution js with compact support: it suffices to define u(p) by the condition 


(26.10) (ule)l«) = | (r(a)lx) ata) 


for all x € H’. To prove the existence of a vector pu(p) satisfying this condition, the 
easiest is to write js as (25.28). The right hand side of (10) then equals 


D [ev dua). 


As the function L;p(g) is continuous, the proof reduces to integrating a continuous 
function with values in H with respect to a genuine measure with compact support: 
Chap. XI, n° 4, (i). Hence 


/ p(g)du(g) = >> / Lip(g).dpi(g) - 


G denoting again a Lie group and (H, 7) a continuous representation on a Banach 
space, one can associate an operator 7° (1) defined by 


T™ (wa = [roadu(s) for all ac H™, 


ie. by 


108 At first glance, the theorem appears to follow from its corollary. It is not so: 
applied to H’, the corollary would require that (x|p(g)) is C™ for all x € H”, 
which is impossible to check in general if H” 4 H. Incidentally, the following 
mistake should be avoided. If 7 is a (continuous) representation from a group G 
to H and if, for any continuous operator T :H +> H, T’ : H’ +> H’ denotes the 
transpose operator, the “contragredient” map g+—> m(g~*)’ is a representation 
of G on H’. But in general, it is not continuous with respect to the strong 
topology. Counterexample: G = R, H = Li(R), nm being the representation by 
translations. Denoting by jf: the function x > f(a — t), strong continuity of 
tt fe for some f € L™(G) means that || ft — fll. tends to 0 as t tends to 0. 
If f is continuous, this requires f to be uniformly continuous on R. 

N. Bourbaki’s booklet, Variétés différentiables et analytiques, 2.6.2, mentions 
in two lines the corollary for every almost complete locally convex space H, 
i.e. in which every closed and bounded set is complete (a set B is bounded if 
all continuous seminorms on B are bounded). This general result can easily be 
deduced from the corollary by observing that every continuous seminorm on H. 
defines a continuous map from H to a Banach space. 
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(26.11) (x (u)alx) = / ((g)a|x) du(g) for all x € H’ 
to every distribution with compact support p. For g € Gand x€ H’, 
(n(g)n* (u)alx) = (x (w)alr(g)'x) = 


= [¢ (y)alm(g)'x) duly )= | joualx) duly) - 


This is the convolution of a C® function and a distribution with compact support. 
The result is C*°. So 7° (4) maps H® to H®. For alla € H®, 1° (1)a belongs to 
the closed subspace H(a) generated by the 7(g)a, because every x € H’ orthogonal 
to these is orthogonal to 1°°(j)a by (11). If H is infinite-dimensional, the vectors 
m*(u)a associated to uw € U(G) are not necessarily everywhere dense in H(a). 
Counterexample: take G = R, H = L?(G) and a function a € D(G). 

Calculations used for measures [Chap. XI, n° 25, (i)] also easily show that 
mw (A)a™ (pw) = 7° (Axjs). Hence in H™ we get all the noncommutative polynomials 
in 7°(X) and m(1) associated to measures with compact support. Generalizing 
definition (25.31’) of R(j) in an obvious manner to all functions with values in H, 
relations (25.34) and (25.36) imply 


(26.12) R(u)r(g)a = 7(g)r™ (Ja, 
(26.13) mn [ad(g)u] = 1(g)n™(n)m(g)*. 


H™ can be equipped with a topology using seminorms 


Pula) = |r" (H)all 


where 4s € U(G), distributions with support {e}. Until further notice, we will only 
consider these. We could even confine ourselves to measures js such as 


b= X1¥*...* Xp 


where all X; belong to a given basis of G’(e). As the function ||7(g)|| is bounded 
on every compact set, relation lima, = a means that the functions R(j)7(g)an, 
or the successive derivatives of 7(g)an, in any chart, converge uniformly on every 
compact set. The topology of H® is thus identical to that of Schwartz on the space 
C™(G;H) of C° maps with values in H. Applying definitions, it is not very difficult 
to show that (i) H® iscomplete (hence is a Fréchet space, because its topology is 
defined by a countable family of seminorms; see Dieudonné, XII), (ii) for every 
distribution js with compact support, the operator 7°°(y) is continuous on H®, 
(iii) for alla € H®, the map g +> z(g)a from G to H® is C*° with respect to the 
topology of H®. 


(iv) The Dixmier-Malliavin theorem. Let us first show that H® is everywhere 
dense in H, otherwise previous considerations would be devoid of interest. The 
proof is the same, slightly less trivial, than the one given above in finite dimension. 
Indeed every y € D(G) defines a continuous operator 


on H, and 


482 XII — The Garden of Modular Delights 


This convolution product is C® since it is the integral of a continuous function 
with compact support which depends in a C’° manner on the parameter g. The 
fact that its values are in a Banach space is not a problem. Hence m(y)a € H®, 
and as y € D(G) enables us to construct Dirac sequences (yn), every a € H is 
the limit of vectors belonging to H™, namely the vectors 7(yn)a [Chap. XI, n° 25, 
(v)], qed. Besides, clearly 


nm (u)m(p) = (UY) 


for every distribution with compact support. 
This result, spectacular for its time but easy,!!° has been greatly improved on: 


Theorem 41. For any representation (7,H) of a Lie group G on a Banach space, 
the Garding subspace is identical to the subspace H™: everya € H®™ is of the form 


a= (pi)ait+...+7(Yp) ap 
with a, €E H™ and yi € D(G) for alli. 


For all i, the supports of y; can also be required to be in a given arbitrary 
neighbourhood of e. Inspired by methods from PDE theory, Dixmier’s and Malli- 
avin’s very ingenious ten page proof requires little knowledge and even covers the 
case of representations on Fréchet spaces. The first step consists in proving the next 
result, starting with G = R: 


Theorem 42. [If G is a Lie group, any function p € D(G) is a finite sum of 
convolutions p *w where p,w € D(G). 


G being assumed to be unimodular, we consider a discrete subgroup I" of G 
(possibly reduced to {e}) and the representation R of G on H = L?(I’\G) with 
1 <p < +00, given by right translations.''’ C°° vectors are functions f for which 
the inner product 


(26.14) (R(x) f\g) = ae f(yx)g(y)dy 


is C™ for all g € L1(I'\G). Hence D(I'\G) C H®. For y € D(G), R(y)f = fy’ 
where y' (x) = y(x~1). By the Dixmier-Malliavin theorem, H® is therefore the set 
of sums of products f * y with y € D(G) and f € L?(I’\G). Thus 


DANO” Cras 


If u is a distribution with compact support, by (11), the operator R°(j:) asso- 
ciated to representation R is given by 


(R™(u) fla) = L (R(e)flg) du(a) = [ dul) if __ funatwiey 


110 Tn Nancy, in 1947, Schwartz and I were rather surprised, not to say upset, when 
Lars Garding published it. We could have proved it “in less than fifteen minutes” 
if only we had thought of it. Garding’s reputation is based on far harder successes 
in PDE theory. For the next two theorems see J. Dixmier and P. Malliavin, 
Factorisations de fonctions et de vecteurs indéfiniment différentiables (Bull. Sc. 
Math., 102, 1978, pp. 305-330). 

111 The case p = +00 needs to be excluded because the representation of G on L® 
is not continuous with respect to the strong topology. 
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for all f € H™ and all g € L*(G). If g € L(I\G), differentiation under the f{ 
sign of the integral over I'\G is possible since f is C°. (25.28) then shows that 


(R*(u)flg) = (f * #'|g) where dyu’(g) = du(g~*). As a result, 
(26.15) R°(u)f =fxp' forall feHu™? 


These vectors belong to the closed subspace H(f) generated by all R(x) f. 

Let us give an outline of the proof that conversely all f € C°(I'\G) such that 
f «pu € L?(L\G) for all uw € U(G) are in H®. The proof resembles that of theorem 
40. For given X € G'(e), we set 


R(t) = Riyx(t)] 


to be the right translation operator by yx (t). For all y € D(I’\G), clearly DR(t)y = 
—R(t)(y * X) in the topology of H. Integrating, we deduce that 


IR@)e — ell = Ile * X11, -O(u) 


(AOR +.) (2 ee *). 


Astr > R(t)y is C™, the right hand side tends to —(f|y * X), which implies that 
Do(R(t) fly) = —(flp*X). But formula (25.35) for the distribution du(x) = f(x)dx 
shows that (fly * X) + (f « X|y) = 0. Hence Do(R(t) fly) = (f * X|y) for all 
yp € D(I\G) and more generally D(R(t)f|y) = (R(t) f * X|y). Since f * X € L?, 
iterating is possible. Thus D?(R(t) f|y) = (R(t) f * X *X |p) is a continuous function 
of t. As a result, 


However, 


(ROF—f4tF* Xly) = [oan Pex v) f+ X* X|y) dv 


for all y € D(I\G). The function being integrated being bounded above by ||f * 
X *« X||p||y||q for all v, it follows that 


|R()f— f+tf«X||, =If*X*X|,0(#) , 
proving the existence of 
DoR(t)f =—f*X 


in the topology of L?. The operator 7(X) of the general theory, defined by (2) when 
the relation is well-defined, is thus defined at f. But as f* € L?(I'\G) is assumed 
to hold for all 4 € U(G), all functions 7(X1)...a7(Xn)f are clearly defined, whence 
f € H°. In conclusion: 


Theorem 43. Let I’ be a discrete subgroup''? of a Lie group G and R the repre- 
sentation of G on H = L?(I\G), 1 < p< +oo. Then H®™ is the set of functions 
such that 

f*xuwEH forall weu(G) 
and then R®(w)f = f xp’. 


112 This assumption is obviously far too restrictive. 
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(v) Analytic vectors. Thanks to the exponential map, every Lie group G can be 
equipped with an analytic structure. In a sufficiently small neighbourhood V of 0, 
there is indeed a C™ function H(X,Y) € G’(e) such that 


exp(X) exp(Y) = exp (H(X,Y)) « 
The Campbell-Hausdorff formula 


HOY SX FY 4 ey o4 
+X, [X, Y]]/12 + [YY X]J/12 - 
—[X, [Y,[X, Y]]]/244+... 


enables us to write H(X,Y) as a series )> H, (X,Y) having homogeneous functions 
of degree n in X and Y as its terms and obtained by applying products of n 
operators of the form ad(X) or ad(Y) to X or Y. The formula is the same for all 
Lie groups and converges for sufficiently small X, Y. The analyticity of H(X, Y) is 
a consequence of this,’!? and all C’° (or even continuous) homomorphisms of Lie 
groups are analytic. 

This raises the question of whether in every representation (H,7) of G there 
are vectors a € H such that g +> a(g)a is analytic.''* The answer is that the 
set H” of these vectors is everywhere dense in H. This difficult result obtained by 
Harish-Chandra for semisimple groups was one of his first successes. P. Cartier and 
J. Dixmier generalized it to almost all groups, but like HC using structure theorems. 
The general case, due to E. Nelson, only uses techniques from partial differential 
equations (heat equation) and has given rise to surprising theorems.!!® 

The operator 7° (ju) preserves H” if w € U(G) (but not if pw is a distribution 
with an arbitrary compact support). Then let M be a closed invariant subspace of 
H. Considering the restrictions of 7(g) to M, we get a representation of Gon M 
for which M* = MNH”*. These subspace is stable under operators 7(X). 

Conversely, if € is a stable subspace of H” under the operators x(X), the closure 
M of E inH is 1(g)-invariant if G is connected. It suffices (Hahn-Banach) to show 
that every x € H’ orthogonal to 7*°(y)a for all 4 € U(G) is also orthogonal to 
all z(g)a. It is, however, obvious that this condition is equivalent to all derivatives 
of the function (m(g)a,x) being zero at the origin. As the latter is analytic, it is 
zero on the connected component of e, whence the result.'!® In particular, for all 
a € H”, the closed subspace generated by all 7(g)a is the closure of the subspace 
of m(u)a, pw € U(G). 

When G is semisimple with finite centre, there are always compact subgroups 
K in G (the pairwise conjugate, maximal compact subgroups) with the following 
property:1!” if (H, 7) is a reasonable (for example unitary) irreducible representa- 
tion of G, and if the restriction of 7 to K is decomposed [Chap. XI, n° 29, (iii)] into 
subspaces H(y) corresponding to the characters of the irreducible representations 
of K, then dim H(x) < +00. 


"13 Detailed presentation in my Introduction 4 la théorie des groupes de Lie (Publ. 
Math. Université Paris VII, 1982 or Springer, 2003), § 6. 

114 Theorem 40 remains valid for analytic maps on a Banach space. See Browder, 
Amer. J. Math., 84, 1962, beginning of pp. 666-710, or Serge Lang, SL2(R), 
appendix 5, § 2. 

15 P. Cartier and J. Dixmier, Amer. J. Math., 80, 1958, pp. 131-145; R. Nelson, 
Annals of Math., 70, 1959, pp.572-614. See also Lars Garding, Bull. SMF, 88, 
1960, pp. 73-93. Warner’s book presents most of these results. 

116 There is no analogous result for H®. 

"” For a fairly simple proof of this result of Harish-Chandra, see R. Godement, 
A Theory of Spherical Functions (Trans. AMS, 73, 1952, pp. 496-556). 
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Theorem 44. Let G be a Lie group, K a compact subgroup of G and (H,7) a con- 
tinuous representation of G on a Banach space H. Suppose that, for some character 
x of K, the subspace H(x) is finite-dimensional. Then all a € H(x) are analytic. 


(a) Let us first show that a € H(x) is C®. To this end, we consider the subspace 
of m(y)a where y € D(G). The operator 


Boj = a= / (k) x(k) dk 


projects H onto H(x) (at least if x is normalized in such a way that y*x = x). Since 
E(x)a = a, a is the limit of vectors of type E(x)m(y)a = m(X)7(y)a = 7(X * v)a. 
The set of these vectors is a vector subspace of H(x). Hence, since dim H(x) < +00, 
there exists y € D(G) such that a = 7(X * y)a. Thus ac H®™. 

(b) Let us show that there is a basis (X;) of G’(e) such that the element A = 
>> X? of U(G) is adg(k)-invariant for all k € K. For this note that, K being 
compact, there is an ad(k)-invariant positive definite quadratic form on G’(e). Every 
orthonormal basis with respect to this quadratic form is suitable. Indeed there are 
equalities ad(k)X; = >> a?(k)Xp where the matrix of a? is orthogonal, whence 
adz(k)A = So, a?(k)a%(K) XpXq— A. 

(c) As 7°(A) commutes with all m(k), m°(A)H(x) C H(x). Since H(x) is 
finite-dimensional, there is a basis for H(x) all of whose elements satisfy a relation 
of the form [7°°(A) — A]?a = 0 for some X € C and some integer p > 1. For 
all x € H’, the function f(g) = (7(g)a,x) then satisfies the differential equation 
[R(A) — A]Pf = 0. However, in the previous n°, using (25.35’), the differential 
operator R(A) was found to be elliptic (and obviously with analytic coefficients). 
PDE specialists will explain that then the same is true for [R(A) — A]? and that 
every solution (function or distribution) of an elliptic PDE with analytic coefficients 
is an analytic function. The function (z(g)a,x) is therefore analytic for all x € H, 
hence so are a and as a result so are all vectors in H(x), qed. 


(vi) The case of unitary representations. All of the above applies to what is the 
most important case, that of unitary representations. As H’ = H, corollary 2 of 
theorem 40 immediately gives the characterization of C™ vectors in H. 

For a,a’ € H™ and any distribution p with compact support, 


(26.16) (x (u)ala’) = (aln™ (ji)a’) 

like in the case of measures and by the same calculations, and in particular 
(26.16’) (°° (X ala’) + (alr°(X)a’) =0 

for all X € G'(e), generalizing (25.36). So the operator im™(X) is symmetric on 
H®~. The next result can then be deduced: 


Theorem 45. Let (H,7) be an irreducible unitary representation of G and w a 
distribution with compact support. If 1 is central, the operator 7° (1) ts a scalar. 


For all scalars A, the operator 
S = [x°(@) + Al] [x (wu) + AI] s HO — HO 


is positive symmetric by (16) and commutes with all m(g), hence is a scalar 
[Chap. XI, n° 23, (ii), Schur’s lemma II: use the self-adjoint canonical extension 
of S]. As this is the case for \ = 0, the operator Aw (fi) + Aw™ (2) is a scalar for 
all A, for example for \ = +1, qed. 
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Let us show that, in every unitary representation, the operator ir(X), defined 
by (2) and extending in™(X), is self-adjoint. As 7(X)a is the derivative of the 
function z[yx(t)]a at t = 0 when it exists, it suffices to prove it for G = R. 
Proposition (ii) of the next theorem will be sufficient. 


Theorem 46 (M. H. Stone). Let t+— U(t) be a unitary representation of R on 
a Hilbert space H. 
(i) There is a spectral measure M on R such that 


(26.17) u(t) = / epOOAMOY forall ts 


(ii) DoU(t)a exists if and only if a belongs to the domain of definition of the 
self-adjoint operator 


(26.18) a= / AdM(d); 
then 
(26.19) Dy (UG a) Sata: 


(iii) The function U(t)a is C™ if and only if 


a € ( |Def (H”) . 


Proposition (i) holds for every commutative Icg. It can be obtained directly 
from the spectral theory of u Chap. XI, n° 22 applied to the GN algebra generated 
by operators U(f) = f f(t)dt where f € L*. Like F. Riesz for G = R, one can also 
deduce Bochner’s theorem [Chap. XI, n° 30, (iii), exercise 4]. 

To prove point (ii), use spectral measures djig,y(A) associated to M. Hence 


(U(t)alb) = [explirt)duas(), 


As was seen in n° 23 of Chap. XI, the subspace Def(#H) is the set of a € H such 
that 


(26.20) J Xauoa0) < +00. 
Then the value of the left hand side is ||Hal|? and 
(Haj) =f Adjio.n(2) 


for all x € H, the integral being convergent. 
Calculation rules of Chap. XI, n° 22, (ii) then show that, for all a € Def(H), 


=) 


exp(it) —1—it= if [exp(tu) — 1] du 


2 


exp(iAt) — 1 Dl ateesONs 


t 


(26.21) a—iHa 


eee 
t 


But equality 
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shows that the function integrated in (21) is dominated by the integrable function 
4). Thus the left hand side tends to 0 as t tends to 0 if (20) holds. Therefore, the 
derivative DoU(t)a = ia’ exists for all a € Def(H), with a’ = Ha. 

Conversely let us suppose it exists. For all x € H, 


(x|ia’) = lim (x| (U(t)a — a) /t) = lim ((U(—t)x — x) /tla) 


since U(¢) is unitary. If x € Def(H), the last term tends to (—iHx|a) by what has 
been proved above. Hence 


(x|ia’) = —(iHx|a) for all x € Def(H). 


As a result, the adjoint H* of H is defined at a and H*a = ia’. But iff is self- 
adjoint. So a € Def(H) and ia’ = Ha. 
Finally, let us suppose that U(t)a soit C™°. Setting D = d/dt, 


iU(t)Ha = U(t)DoU(t)a = DU(t)a. 


This is a C™ function of t. As a consequence, Ha € Def(H) and a € Def(H”). 
Iterating the calculation gives a € Def(H”) for all n. Conversely, we suppose a € 
Def(H”) for all n. As a € Def(H), the function U(t)a is differentiable and has 
derivative iU(t)Ha. As Ha € Def(H), the result can be applied to Ha. So 


D°U(t)a = —U(t)H’a 


exists, etc, proving point (iii) of Stone’s theorem. 
Observe that conversely, any self-adjoint operator 


H= / \dM(A) 


arises from a unitary representation 
U(t) = [expitx)ar) 


of R. We set U(t) = exp(itH), not to be confused with the series, which is not 
well-defined. 

Exercise 5. Suppose G = R. Show that, for the regular representation of G, 
H° is the space of functions f whose Fourier transforms satisfy [ |f(A)|292"dd ze 
+oo for all n. These functions are C’®. All their derivatives are in L* and have 
integrable Fourier transforms. Interpret the Dixmier-Malliavin theorem. 

Exercise 6. Let U(t) = fe(At)dM(A) be a unitary representation of R. Show 
that, for any compact set w C R, the elements of the spectral manifold H(w) are 
analytic and the infinitesimal generator H = [ AdM(A) is defined and continuous 
on H(w). Show that the operator A= fexp(—7\*)dM(A) maps H to the subspace 
of analytic vectors of H. 

Exercise 7. Consider the group, named after Heisenberg, of real matrices 


Ka) 

II 
oor 
Ors 
rer 


(i) Show that there is a basis (X,Y, Z) for G’(e) such that 
[X,Z]=[Y,Z]=0, [X,Y] =Z. 
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(ii) In H = L?(R), consider operators 
m(g) f(t) = ela(z + ty)| f(t+2), 


where a € R is given and non-zero. Show that 7 is an irreducible unitary repre- 
sentation of G and that, for all y € L(G), 7(v) is a Hilbert-Schmidt operator.1!® 
(iii) Show that H°° = S(R), the Schwartz space (Chap. VII, n° 31) of rapidly 
decreasing C® functions and of their successive derivatives. 

Exercise 8. Let (H,7) be a unitary representation of a connected Lie group and 
A:H® —+ H® acontinuous operator with respect to the topology of H® defined 
at the end of section (iii) (A is not assumed to be defined on all of 1). Show that, 
like in finite dimension, 


(26.22) An(g) =1(g)A An (X) = 2° (X)A. 


27 — Differential Operators on SL2(R) 


From now on, nr 15 and 16 of this chapter will be frequently used. Almost all of 
this end of chapter is taken from a course I gave in Paris in 1971-1972. 


(i) The Lie algebra of SL2(R). As the space of real matrices with zero trace 
has dimension 3, for example the matrices 


ay x=). 69). 1-03) 


form a basis for the Lie algebra G’(e) of G = SL2(R). X, H and Y are the tangent 
vectors at t = 0 to the one-parameter subgroups 


(27.2) a) = € i) » A) = G - » WO= (; ) 


already extensively used with a different parametrization for h(t). Hence 
27.3 a(t) =exp(tX), h(t)=exp(tH), y(t) =exp(ty). 
These subgroups act on the upper half-plane P by 

27.4 a(t)z=z+t, h(t)ze=e™*z, y(t)z=1/(tz+1). 


The commutation formulas 


27.5 (H,X]=2X, [H,Y]=-2Y, [X,Y]=H 


occur in every semisimple groups. 
The tangent vector to the one-parameter subgroup of orthogonal matrices at 
the origin will also be needed. Differentiating the matrix 


cos27t sin 2at 
(27.6) k(t) = 6 sin2mt cos sg % 


we get the matrix 
"8 Plancherel’s formula for this group will be found in my mémoire du Journal de 


Liouville (XXX, 1951, pp. 92-101). All this was generalized to unipotent groups 
by J. Dixmier, A. Kirillov, etc. 
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01 
—1 0 


O -1 


(27.7) A=2n ( ) =27iW, where W = € i) =i(Y —X). 


W is in the complex Lie algebra g of G. Hence 

k(t) = exp(tA) = exp(27itW) . 
The operator R(W) corresponding to W is defined by 
(27.8) aniR(W) f(g) = Dof [ak(t)] - 


For every function f € Fee = F,NC°(G), ie. every C™ solution of the functional 
equation 


Ff [gk(t)] = fla)xr(t) = f(g) exp(2nirt) , 


(27.9) RW)f=rf 


and this property characterizes f € Fre. 
Together with W, the complex matrices 


Z=2 (; 5) =1(H+2X4+W), 
(27.10) axe € 7) =i(H-2%X-W), 


form a basis for g and once again 


27.11) [W,Z])=2Z, [W,Z])=-227, [Z,Z]=W. 

These formulas being valid for all corresponding operators, if f € Ff°, then 
27.12’) R(W)R(Z)f = 2R(Z)f + R(Z)R(W)f = (r +2)R(Z)f, 

oF 12") R(W)R(Z) f =-2R(Z) f+ R(Z) R(W)f = (r —2)R(Z) f. 


Setting as we will generally do to simplify notations 


27.13) Mf=R(M)f=-f*M, MNf=R(M)R(IN)f, etc. 
for all M,N € g and all function f on G, these results can be presented in a diagram 


lore) 
r+2 


Ww 
(27.14) Fe a 


which is going to play an important role. 
Using convention (13) and commutation formulas, we get 
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[W?,Z] =2.WZ+ZW); [W?,Z] =-2(WZ+ZW) , 
[ZZ,W] = |ZZ,w] =0, 
[27Z,Z] =-WZ, [22,7] =Zw, 
[2Z,Z])=-ZW, [2Z,Z])=wZ. 


These are calculations in U/(G), and not of matrix products. Hence 


[22+ 27,2) =-5[W,Z], (22+22,Z7]=-5[w,Z| 
Thus the element 
(27.15) Ge (ZZ + 22) +W?/4= 27 +W?2/4—W/2 


of U(G) commutes with W, Z and Z. It is the Casimir operator''® of G. As it 
belongs to the centre of U/(G), the differential operator R(Q) commutes with left 
and right translations. 


(ii) Differential operators on the half-plane. In n° 15, (ii), a function 


f-(g)=(ci+ df), z= 


belonging to the space F,-° of functions of weight r on G was associated to every 
C@® function f(z) on the half plane P and to every r € Z. Letting operators Z 
and Z act on f,, we get functions in F2{. and F22. to which functions on P are 
therefore inversely associated. They will be denoted by Z,f and Z,f: 


(27.16) g9=24,f Gr+2 =Zfr, g Lng Gr—2 Zfr . 


These operators arise from right translations on G, hence commute with left trans- 
lations. However, these are the operators L(g) of n° 15, (ii) acting on the half-plane 
P. Hence 


ZrLp(g) = Lrt2(g)Zr42, ZrLr(g) = Lr—2(g)Zr-2.- 


An immediate consequence of these relations is that, if I is a discrete subgroup of 
G, the operator Z, (resp. Z,) transforms every generalized automorphic form? of 
weight r for I’ into a generalized automorphic form of weight r+ 2 (resp. r — 2) for 
rT. 


To express these operators explicitly, we start from the relation 


119 Tf g is a Lie algebra, the bilinear form K(X,Y) = Triad(X)ad(Y)] is invariant 
under the adjoint representation. By definition, g (or G) is semisimple if K is 
non-degenerate. If (X;) and (Y‘) are bases for g such that K(Xi,Y’) = 8, 
the element Q = X;iY" is in the centre of the enveloping algebra and does not 
depend on the chosen basis. It the Casimir operator of g, invented in the 1930s 
by a physicist who later became the technical director of Philips. H.B. Casimir 
and B.L. van der Waerden used it at the time to prove purely algebraically 
that all finite-dimensional representations of g are direct sums of irreducible 
representations. 

120 i@., recall that these are the C®° solutions of f(yz) = (cz + d)" f(z) and are not 
necessarily holomorphic. But operators Z, do not preserve holomorphy. 
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1 med 
(27.17) Fla + iy) = 7" fe(9) ro=(" vi)ee 
y 


of n° 15, (ii). Hence calculations for this value of g are sufficient. As the action is 
on functions f(z) that actually are functions f(z, y), since they are not necessarily 
holomorphic, in what follows we will set 


D, =d/dx, Dz =d/dy, 
(27.18) D=D2+id, = 2i0/0z, D= Dz —iD, = —2i0/02. 


So holomorphic functions are solutions of Df = 0. 
This being so, 


2Z=H+W42X, 2F%=H-W-2xX 


and R(W) = r in F,. Computing the real operators R(X) and R(#) using the 
general definition (25.9) is therefore sufficient. 

As X = 2'(0), differentiating the function t > f,-[gx(t)] = y’/? f(a + ty + iy) 
at t =0, R(X)f,(g) = X fr(g). Thus 


(27.19) Xf aye? A Dis): 


For H =h'(0), fr[gh(t)] = (e?*y)"/? f(a + iey), whence 


(27.20) Hf-(g) = ry"? f(z) + 2y°?/? Do f(z). 
Finally, Wf-(g) = rfr(g) = ry"/? f(z). So 
27.21) 2Z fr(g) = 2ry"/? f(z) + 2y°t?/? D f(z), 
27.21’) OZ fig) = 2p" Diz): 


and using (17) for r and r + 2, we finally get 
27.22) Zr f(z) = (ry? + D) f(z), 
D799") Zrf(z) =y?Df(z). 


The second relation proves that, for every function y € C™(G), 


27.23) peur = Wyery & Zp=0. 


Operators (22) and (22’) were introduced by Maa in 1953 in the context of 
modular functions without any reference to Lie groups, but they already appeared 
in Bargmann (1947) for the unit disc, which comes to the same. 

Setting 


(27.24) A=yDD=y’ (d’/dx? +d? /dy’) , 
easy calculations show that 
(27.25) Zp-2Z, = A+ryD. 


Similarly, using (24) and [Z, Z] = W, we get 
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(27.24’) Zr+2Zr = A+ryD—r. 


A differential operator of the second order Q commuting with W, Z and Z 
was introduced in (15). Since it in particular commutes with W, it acts on each 
space F;°(G), hence on functions f(z) on the upper half-plane. Since it commutes 
with left translations, it also acts on F-°(I"\G). For given integer r, associating the 
corresponding function f,(g) to a function f(z), @ transforms it into a function 
which, by the same formula, will define a function on the half-plane which we will 
write 2, f(z). Since 22 commutes with left translations, 2, commutes with operators 
L,(g) of n° 15. 

To compute (2, explicitly, we write 2 as 


(27.26) Q=ZZ+W?/4—-W/2. 


Then, since W =r on FY, 


rr = 
(27.27) 2, fz) = 5 ( ; 1) f(z) + Zr—-aZrf (2). 
Taking (24) into account, it follows that 
rr = 
(27.28) 2, = 5 (; 1) bryD+A. 


In particular, Q9 = A, the invariant Laplacian of the half-plane, which commutes 
with operators Lo(g) f(z) = f(g~'z). In conclusion: 


Theorem 47. Let I’ be a discrete subgroup of G and f a generalized automorphic 
form of weight r for I. Then the functions 


Zrf(z) =rf(zy* + Df(z), 


Zrf(2) =y'Df(z), 


= r 
2, f(z) = Af) +ryDI@) + 5 (F -1) £@ 
are generalized automorphic form of weight r+2,r—2 andr for I. 


Like Maa in 1953, these results could be obtained by direct calculations. Half 
a century of experience shows that “elementary” and “explicit” calculations gen- 
erally become inextricable’?! beyond groups of dimension 3 or 4 and that by dis- 
regarding the impressive technique of semisimple groups, we become cut off from 
fundamental ideas. Besides, even in the present case, checking “without knowing 
anything” that operators Z, and Z, act on automorphic forms is an exercise in 
which it is easy to get lost and which is omitted by all authors I am acquainted 
with. So let us indicate how to proceed for Z,, the other case being left to the 
reader. 

Starting from the equality f(yz) = (cz + d)" f(z), we get 


D[f(y2)| = (cz + d)"Df(z) 


since (cz + d)" is holomorphic. As D is a differentiation operator, the chain rule 
shows that!” 


21 This can be realized by reading H. Maa8, Lectures on Siegel’s modular functions 
(Tata Institute, 1954-1955), § 15. 

122 Recall that a notation such as Df (yz) denotes the value of the function Df(z) 
at yz, whereas the notation D|f(yz)] denotes the effect of D on the function 


ze> f(yz). 
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D[f(y2)] = Dif (7z).D [Re(qz)] + Dof(7z)-D [Im(7z)] . 
However, D(z) = 0 and so D[Re(yz)] = —iD[Im(yz)], whence 
D[f(q2)| = Df(yz).D [Im(7z)] = Df (yz).D Gz — 72) /21 = 
—Df (yz)-D (92) /24 = Df (yz). (+4) 


Thus 
Df (yz) = (cz +d)? (cz +d)" Df (z). 
It follows that 
ZF (yz) = Im(yz)"Df (yz) = y"|cz + dl~* (cz +d)? (cz + d)"Df(z) = 
= (cz +d)” *Z,f(z) 
as predicted. 


The simplest eigenfunction of A is f(z) = y*, which satisfies 


27.29) A (y*) = s(s—1)y*. 
Hence, for the corresponding function 

27.30) fo(g) = a(g)* 
of weight 0, 

27.29’) Qfo = s(s —1)fo. 


As the Casimir operator commutes with left and right translations, the Maa8 series 
27.31) M(z;s8) = be Im(yz)* 


also satisfies (29) when it converges and even, by analytic extension, when it does 
not. 

To get eigenfunctions of 2, for any even r, it suffices to repeatedly apply op- 
erators Z and Z on fo. Due to diagram (14), this amounts to applying operators 


Zop—2...Z2Zo and Zop-2...Z2Z on y*. We find 


(27.32) Zof(z) =sy* ', ZeZof(z) =s(s+1)y*? 

and 

(27.32) Zof(z) = sy°t’, ZeZof(z) =s(s+1)y*t?,.... 
s—k 


As we get the functions y°~", up to a constant factor (sometimes zero...), the 
operators used transform the first series (31) of weight 0 into the series of weight 


2k associated to y°—*, namely 


(27.33) Mox(z; 8) = S_ J(y32)7** Im(yz)** (Kk > 0), 


and the second one into the analogous series of weight —2k. The corresponding func- 
tions on G are therefore all eigenfunctions of (2. We will return to these calculations 
in n° 29. 

Exercise 1. How should the previous calculations be modified so as to obtain 
eigenfunctions of odd weight? 
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28 — The Representation of g Associated to a Holomorphic Function 


(i) The g-module HC;,(f) = HC(f,). For a given integer r, let us start from a non- 
trivial holomorphic __ function on the half-plane and associate the function f, € H, 
on G to it. Hence Zf, = 0, Wf, = rfr. We show that 


(28.1) ZZ" fr =dnZ"" fr 
with scalars An € C. For n = 0, this is obvious, with Ao = 0. For n > 0, we write 
ZZ" = 222" =([Z, Z| Z" 14+ 222" =-W2"*4277"". 


Hence, supposing that (1) holds for n — 1, the induction relation An = An-1 — (r + 
2n — 2) proves it for n. As Ao = 0, 


An = —n(n+r—1). 
Therefore, setting 
(28.2) Grin = Z"f, forn>0, 
we get the following formulas: 
ZGOr+2n = Qr+2n+2 
W gr+an = (r + 2n)gr+an 
(28.3) ZGrt2n = —n(r +n —1)gr+on—2- 


HC(f,) or HC;(f) will denote the set of finite linear combinations of the functions 
Jr+2n- It is stable under operators of g and so under R(y), w € U(g). 


Since 
(28.4’) ZZ grtan = —n(r +n—1)gr+42n 
(28.4”) ZZGrton = —(n+1)(r+n)gr+2n 


follow from (3), 
(28.5) Qgrtan = 8(8—1)gr42n with s=r/2. 


So the Casimir operator reduces to a scalar in HC,(f). 
For r > 0, these formulas can be simplified by setting 


(28.6) Cr=fr, Crt2n= gr+an[ T(r + 1)...(r+n—1) forn>1. 
Then 


Zem = (s +m/2)em+2 , 


Wem = ™Meém, 


(28.7) Zem = (8 — m/2)em—2 
form = r,r+2,... The vectors em are all 4 0, since the coefficients s — m/2 = 
4(r—m) being 4 0 for m > r, the relation e,, = 0 would imply em_2 =... =er = 0. 


For r > 0, the space HC;,(f) is therefore infinite-dimensional, but ,as we will see, 
not necessarily for r < 0. 

Furthermore, for r > 0, HC,(f) does not contain any non-trivial subspaces 
stable under g. Being stable under W, such a subspace is indeed generated by 
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the vectors em it contains. But formulas (7) with all coefficients # 0, show that 
it then contains all e;,. The Lie algebra g of G therefore acts in an algebraically 
irreducible manner on HC;,(f). So HC;,(f) is a simple U/(g)-module, as one says in 
Algebra. Since only r occurs in formulas (7), the structure of HC;(f) as a g-module 
is independent of the choice of f. We thereby get the following result: 


Theorem 48. Let I be a discrete subgroup of G, f(z) a holomorphic automorphic 
form of weight r > 0 for I and 


y(g) = fr(g) = (ci +d)" f(g) 


the associated function on G. Let HC(y) be the vector space generated by the func- 
tions R(Z)"y. Then the spaces w € HC(y) are left I'-invariant, HC(y) is stable 
under g and a simple g-module. For given r, all the simple g-modules thus obtained 
are isomorphic. 


For parabolic forms, i.e. such that y € L?(I°\G), this result is the infinitesimal 
analogue of theorem 27 of n° 20 which says that the representation of G on the closed 
invariant subspace H(y) of L?(I’\G) generated by ¢ is irreducible. By point (b) in 
the proof of theorem 27, vy is the only element of weight r of H(y), up to a constant 
factor. So by theorem 43, it is an analytic element of the representation R of G on 
H(y) or L?(P\G). For all wp € U(G), R@ (wy = v* pp’ © H(y) by (26.15), whence 
HC(y) C H(y). Moreover, HC(y) is everywhere dense in H(y), because if there 
is some g € H(y) orthogonal to all y * ys, all derivatives of the analytic function 
(R(x)y|g) are zero at the origin. So g is orthogonal to all R(x), hence zero. 

For I = e and r > 2, theorem 47 applies to ¢ € H2(G) and in particular to 
the kernel function w, [n° 16, (iv)]. Transforming the latter by left (resp. right) 
translations generates the closed subspaces H2(G) [resp. H(w,)] of n° 16 and 20 in 
L?(G). As in both cases w,. is an analytic vector (theorem 41), the former (resp. 
latter) space is the closure in L?(G) of the set of functions .*w, (resp. w, * 44) where 


pw €U(G). Since (wr * w) = [i * wr, the map y +> ¢ is a semilinear isomorphism 
from H(w,) onto H2(G) transforming the representation R of G on the former space 
into the representation L on the latter one. The same conclusion follows concerning 
representations of g on these spaces, defined by L(z)f = yw * f in the former case, 
by R(u)f = f *p’ in the latter. Relation R(W)w, = rw, satisfied by all f € H,(G) 
becomes L(W)w, = —rwy, so that ?(G) has a basis consisting of eigenfunctions 
of L(W), the eigenvalues being the integers —r — 2n (n > 0). 


(ii) The case r = —p < 0. Formulas (3) become 


ZG2n—p — 92(n+1)—p 
(28.8) Zgon—p = (p+ 1—1n)gon-1)-p - 


There are then two possible cases. 

If gon-p # O for all n > 0, the space HC(f,-) admits as basis the functions 
9—p; G—p+2)--+>9p,9p+2;--- and is infinite-dimensional. But it is no longer a simple 
g-module. Indeed Zgp+2 = 0 by (8). Thus the subspace F’ = HC(gp+2) generated 
by 9p+2;9p+4,--. is stable and corresponds to case (i) for r = p+ 2. As the Lie 
algebra acts both on HC,(f) and F’, it also acts on the quotient space HC;,(f)/F’ 
which, as a g-module, is simple and of dimension p + 1. 

If on the other hand g2n—-p» = 0 for some n, then, applying Z, gan+2-p = 0 as 
well, so g2m—p = O for all m > n. Hence if n denotes the smallest integer such 
that gon+2-p = 0, whence gon-p # 0, the second relation (8) applied to n + 1 
shows that n = p. So, this case occurs if and only if gp42 = Z?*'f, = 0. The 
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p+1 vectors g—p,g—p+2;---;9p then form a basis for the space HC(f;,), which is 
of dimension p+ 1 and is again a simple g-module (same arguments as above). In 
this case too, formulas (6) can be used since denominators are # 0 for 0 < n < p. 
As s =r/2 = —p/2, we recover (7) in the form 


(28.9) Lem = -50@ + mM)em-—2 - 


(iii) Finite-dimensional simple g-modules. Every simple g-module € of finite 
dimension p+ 1 is easily expressible by formulas (9). For this, observe that W has at 
least one eigenvector a in € and that the relation Wa = Aa implies W Za = (A—2)Za 
because of commutation formulas. As W only has finitely many eigenvalues, there 
exists a £ 0 in € and r € C such that Wa = ra, Za = 0. As above, the subspace 
generated by the Za is stable under g, hence equal to €. Thus the non-zero Z"a = 
Gr+2n form a basis since they belong to eigenvalues r+2n of W . Formulas (3)and (4) 
continue to hold since they are only based on commutation relations. Since € has 
dimension p + 1, as above ay, @r+2,...,@r+2p are non-zero and form a basis for €. 
Hence ZZar+2p = 0, and so r = —p, a negative integer etc. 

The existence of a basis satisfying (9) is uniquely based on commutation rela- 
tions satisfied by W, Z and Z. Hence the arguments could also be applied to the 
operators associated to the matrices H, X and Y defined in (27.6). Hence, if € is 
a simple g-module of dimension p + 1, there is also a basis u_p, u_p+2,...,Up in € 
for which, instead of (9), we have 


a 
Xtim = —5(p— mms 
Hum = Mum , 
1 
(28.10) Yum = 5” + m)Um-2. 


Conversely, these formulas obviously always define a simple g-module of dimension 
pti. 


(iv) Condition for dim HC,(f) < +00. If we knew when the g-module HC,(f) 
of a holomorphic function f has finite dimension p+1, then these calculations would 
be complete. This supposes r = —p and is equivalent to Z?*' f_, = 0, hence to 


(28.11) ZpZp-2-.. Zp f(z) =0. 


Hence this product of differential operators has to be computed taking into account 
definition 7, f = ny~'f + Df, where D = d/dy + id/dx. We are going to do this 
for all C® functions f. 

First it is clear that Zo f = Df. Now let us calculate 


“AZaf =(y*+D) (-y"f+Df)= 
=-yf+y'Df-y'Dft+ty?f+D'f=D’'f. 
Hence 


(28.12) VY Nee See aaa 8 i 
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for p = 0 or 1. We show by induction that this formula holds for all p. Since 


Zp Zp —2... Zp = Lp (Zp_2... 2912) Z_p = Zp D” "Z_» 

we only need to prove that 
(28.13) VA B las ee Baia 
in other words that 

(py + D) D?™ (py f + Df) = D?*"f. 
The left hand side being equal to 

(py™* + D) [D?f — pD?™ (y" f)] = DP fF — pD? (yf) + 
+ py DP f — py D> (yf) , 

the proof reduces to showing that 

D? (yf) =" D?f —py "DP (y"f) 5 


or, replacing f by yf, that D?(yf) = yD? f + pD?~'f, qed. 
In conclusion, the space HC;(f) of a holomorphic function f(z) is a simple 
p+ 1-dimensional g-module if and only if r = —p and 


(28.14) D?*? f(z) =0, 


in other words if and only if f is a polynomial of degree p (and not < p since 
Z f, # 0). The images L_p(g)f(z) = (cz + d)’ f(gz) are then polynomials of 
degree < p. 


(v) A theorem of Maaf. Applied to a holomorphic or even to a meromorphic 
form, formula (12) leads to the function 


DP f (2) = ZpZp-2--- Z-pf (2), 


proportional to fPFY(z). On the other hand, we know that, for every discrete 
subgroup I’ of G, the operator Z transforms generalized automorphic forms of 
weight r into generalized automorphic forms of weight r+ 2. So, for r = —p > 0, 
the operator D?*! transforms forms of weight —p into forms of weight p+2. As a 
result: 


Theorem 49 (Maa). If f is a generalized (resp. meromorphic) automorphic 
form of weight —r < 0 for a discrete subgroup I of G, then O°*'f/Az"*" is a 
generalized (resp. meromorphic) automorphic form of weight r +2 for I. 


The reader will check that with the strict definition of meromorphic forms 
(behaviour at parabolic fixed points) the theorem holds for Fuschian groups. 

Theorem 49 leads to other strange identities. For example let us take [ = G(Z) 
and 


f(z) = 1/A(z) = e(—z) + bo + bie(z) +... ; 


f is of weight —12, holomorphic on P and has a simple pole at infinity. The function 


F(z) = f(z) = (—2mi)e(—z) + (20i)8bie(z) +... 


498 XII — The Garden of Modular Delights 


is, therefore, a modular form of weight 14, holomorphic everywhere on P and having 
a simple pole at infinity. As a result, A(z) F(z) = (—2z7i)'3+4+... is an entire modular 
form of weight 14+ 12 = 26. Thus (n° 18, theorem 21) 


q'3 
dzi3 


A(z) A(z)~* = (—2mi)'3 Ex6(z) + cA(z) F14(z) 
with a constant c whose calculation we leave to the reader. 

The case of inverses of Eisenstein series can also be dealt with in principle. Let 
us for example choose f(z) = E,(z)~! = 1 — 240e(z) +... [n° 17, (iii)], whence 
F(z) = f®(z) = —240(2mi)°e(z) +.... This is a modular form of weight 6 whose 
only singularity, modI’, is at 7, where v;(F’) = —6 since v;(1/E1) = —1. The 
function F(z) FE4(z)°® is, therefore, an entire modular form of weight 30. As a result, 


@ B4(z)~ = aBao(2) + bA(2)Eisl2) + cA(2)?Ba(2) 


a (2) dz * 


with constants a, b, c that can be calculated explicitly by investigating the first few 
terms of the Fourier series of both sides. 


29 — Irreducible Representations of g 


(i) Classification. For all s € C, formulas 


Zem = (s +m/2)em+2 , 
Wem = ™Mem , 


(29.1) Zem = (s —m/2)em—2 


obtained above in (28.7) can be used to construct other g-modules different from 
the modules HC;(f) associated to holomorphic functions to P. To this end, we 
consider a vector space € admitting a basis (em), m € Z, and define W, Z and Z 
on € by these formulas. Trivial calculations show that indeed we get a g-module on 
which the Casimir operator is given by 


2Q=s(s—1). 


This g-module €, is not simple: the subspace E{ (resp. E; ) generated by vectors 
of even (resp. odd) weight is invariant. 

Since W is diagonalizable, every submodule of €7 is generated by the vectors 
€m it contains. If s ¢ Z, the coefficients s + m/2 of (1) are non-trivial, so that 
applying the monomials Z?Z" to some em for m even, we get all the en with n 


even. A similar conclusion holds for the vectors em, with m odd, if s — 4 ¢Z. Asa 
result, the g-module ES (resp. E; ) is simple if s (resp. s — 3) is not an integer. 

If s = r/2 for some even integer r, then Ze, = 0 and Ze_, = 0. The vectors 
€r+2m, m > 0, generate a submodule F' in Es isomorphic to the modules HC(f,) 
of the preceding n°. It is irreducible if and only r > 0. Similarly, the vectors e_,_2m, 
m > 0 generate a submodule F”, which is irreducible if and only if r > 0. In this 
case, the quotient module €}/(F' + F”) is irreducible and finite-dimensional. On 
the other hand for r < 0, the vectors e,,...,e—, generate an irreducible submodule, 


namely F’M Ff”. The same conclusions follow for €> if s = r/2 for an odd integer. 
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As will be seen without going into detailed easy calculations, all simple g- 
modules simples are obtained in this way provided they are required to be ad- 
missible: this means that W is diagonalization, with integer eigenvalues'*? and 
that, for all r, the subspace €(r) defined by Wf = rf is finite-dimensional. 

Indeed let € be such a g-module and let us suppose it is simple. A first result 
is that the Casimir operator is a scalar. Indeed let r be an eigenvalue of W and 
E(r) the space of solutions of Wa = ra. Since 2 commutes with W, it acts on E(r), 
hence has at least one eigenvector. Since 2 commutes with W, Z and Z, the set of 
vectors of € associated to a given eigenvalue of (2 is g-invariant, whence the result. 

This being settled, the formula 


Q=ZZ+4+W?/4-W/2=ZZ+W?/44+W/2 


shows that, for any eigenvector a of W, ZZa and ZZa are proportional to a. The 
subspace generated by vectors Z?a and Za is thus stable under Z and Z, hence 
invariant. So, choosing a non-trivial a; such that Wa, = ra, and setting 


wont 
Ar+2p = Zar, aAr—2q = Z ar, 


the vectors a, # 0 form a basis for €. Setting 2 = s(s—1) and using commutation 
formulas, 


(29.2) Zam = (s+m/2)(s —m/2—l)amsz2 if m<r, 
(29.2’) Zam = (s—m/2)(s+m/2—l)amz2 if m>r 


easily follow. If all the coefficients occurring in these formulas are non-trivial, which 
supposes that 2s is not an integer with the same parity as r, then the same is true 
for all vectors ar+2n and the module is simple. The only possibilities are easily seen 
to be as follows: 


a) m=...,—4,—2,0,2,4,... and non-integer s; 

b) m=...,—3,—-1,1,3,... and non-integer s — 5; 

c) m=r,r+2,...ands=r/2,r>1; 

d) m=-r,-r—2,...ands=-—r/2,r>1; 

e) m=-—p,—p+2,...,p—2,p et s = —p/2 or 1+ p/2. 


In cases (a) and (b), the vectors e, of (1) are obtained by setting e, = a, and 


29.3) €rton = (st+r/2)7*...(st+r/2+n—1) ‘aryan, 
29.3’) €r—on = (8 —7/2)*...(s —7r/2+n—1) ‘Gr4on 


for n > 1. In case (c), the second formula is not well-defined and the expected basis 
of € is provided by the first one, the second one being, however, suitable for case 
d). Note that the value of 2 is not sufficient for characterizing the representation 
and that, in cases (a) and (b), nothing is changed by replacing s by 1 — s. This is 
more obvious for relations (2) and (2’) than for formulas (1). 


123 This condition follows from the fact that, in any representation of the group G, 
characters of the compact subgroup K which occur are functions 


xr [exp(27itW)] = exp(2z7irt) 


with r € Z. In representations of the universal covering of G, where K is replaced 
by a group isomorphic to R, the eigenvalues of W associated to C™ vectors can be 
real numbers. The following calculations continue to hold with obvious changes. 
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Representations of type (a) (resp. (b)) constitute the even (resp. odd) principal 
series. Types (c) (resp. (d)) cover the holomorphic (resp. antiholomorphic) discrete 
series. Finally case (e) corresponds to finite-dimensional representations. 


(ii) Function space models for representations of g. Whether a group or a Lie 
algebra, one of the basic problems of representation theory consists in finding con- 
crete realizations. One always tries to construct function space models, where the 
space H. of the representation is composed of functions (possibly with values in 
Banach spaces) or even of distributions on G on which G acts by right (or left) 
translations in the case of G, by differential operators R(X) [or L(X)] in the case 
of g. 

The case of SLZ2(R) is particularly simple. Observe first that any function space 
model (on the right) of an irreducible representation of g has a base consisting of 
distributions y such that 


R(W)w=—-wxeW=rp, wx Q=s(s—1)p. 


Operators iR(M) are symmetric on the subspace D(G) of L7(G), for all M € G'(e). 
Hence R(M)* = —R(M) for all M € g. As aresult, 22 = —(ZZ*+Z*Z)+W*W/2. 
However, ZZ* + Z*Z + W*W/2 is positive-definite symmetric on D(G) and thus 
corresponds to a an elliptic differential operator on G with analytic coefficients. 
Since any eigenfunction or distribution 4p of W and 92 is clearly an eigenfunction 
of this operator, the theory of PDEs enables us to conclude that w is in fact an 
analytic function on G. With a slightly less simple argument, this result can be 
generalized to function space models of any admissible representation of g (or of 
the Lie algebra of a general semisimple group). 

The spaces HC(f,) associated to a holomorphic function on P are obviously 
function space models of representations of the holomorphic discrete series (c). 

To obtain series (d), it suffices to transform a model for series (c) by the map 
fro f.AsW =-—-W, relation f * W =—rf implies f x W =+4rf. 

Another method consists in transforming a right model for series (c) by f +> f. 
This gives a left model for series (d). Indeed W = —W = W, so that 


R(W)f =-f*W=rf— LW)f=W*f=(f*W)=-rf. 


For example let us choose the kernel function f = w, of the space H2(G) [n° 16, 
(iv), defined by (16.31)]. The g-module HC(w,.), i.e. the set of w,*4 where p € U(g), 
belongs to series (c). As @, = w,, the functions pz * w,, therefore, form a model for 
series (d). 

For finite-dimensional representations, we make G act on R? by (x,y) +> (av+ 
by, cx + dy) and use the obvious representation of g (or of G) on the space of 
homogeneous polynomials of a given degree. 

The construction of the function spaces H which are transformed in accordance 
with the principal series by operators R(M), M € g, is then immediate. In case (a), 
H. has a vector fo satisfying 


W fo =0, Qfo = s(s— 1)fo. 


In a function space model, fo is, therefore, the function on P of weight 0 corre- 
sponding to a function f(z) such that 


y’ Af = s(s—1)f. 
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To satisfy these conditions, for f(z) it suffices to choose a solution of this PDE, for 
example y° or y'~*, and to denote by fo the weight 0 function corresponding to f, 
for example 


fo(g) = Im(gi)* = a(g)°. 


We then obtain a basis for HC(fo) by applying the powers of the differential oper- 
ators Z and Z on fo, ie. by applying operators Zop...Z2Zo0 and Z2q ...Z220 on 
f(z). Formulas 


Zr f(z) = (ry +D) £2), Zrf(2) =" DF2) 
show that, for f(z) = y’*, 


(29.4) Zof(z)=sy*', ZeZof(z) =s(s+1)y*”’,... 
and that 
(29.4’) Zof(z) =sy*t', ZoZof(z) = s(st+1)y*"’,... 


On G, the left hand sides of (3) and (3’) correspond to functions 
Z” fo(g) = s(s +1)... (s+n—1(ci+d)~-*"\ci+ dj-75t?" (n> 0), 
Z" fo(g) = s(s +1)... (s—n+1)(cit+d)"|ci+d\->-2" (n> 0). 


of weight 2n and —2n. Formulas (4) and (4’) for r = 0 then show that the functions 
(29.5) €2n(g) = (ci +d)" |ci + d|-7°1?"— (n € Z) 
satisfy relations (1) for non-integer s € C. 


For fo(g) = Im(gi)*, the functions » € HC(fo) are obviously right K-finite and 
are transformed by the subgroup UH according to formula 


(29.6) g(uhg) = |t!*e(g) = a(h)*9(g) 
for u € U, h = diag(t,t~'), t # 0. Conversely, any K-finite solution of this func- 
tional equation satisfies y(—g) = y(g), hence is a linear combination of func- 


tions (5). The space of these functions is, therefore, a function space model for 
the even principal series of representations of g. 
The odd principal series is obtained likewise by replacing fo with the weight 1 


function corresponding to yoo 2, namely 


fig) = (ci +.d)7* Im(z)°>? = (ci +d) ei + d|?~?*. 
Then 


(29.7) y(uhg) = sgn(t)|t|"* (9) 


for all yp € H. 

There are also models composed of automorphic functions for a discrete group 
I having a parabolic cusp at infinity. In case (c), any (holomorphic) automorphic 
form of weight r is suitable. To realize case (a), it suffices to choose the Maaf8 series 
M(z; 8) associated to an arbitrary parabolic cusp of I’. As it is obtained by making 
a left translation of the above function a(g)° left [’-invariant and as the Lie algebra 
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acts on the right, the formulas are necessarily the same as before.!24 This supposes 
Re(s) > 1, but the analytic extension enables us to remove this restriction. From 
the point of view of representation theory, these properties of Maa& series justify a 
posteriori their introduction and allowed Langlands to generalize the theory to all 
semisimple groups. 

Lastly, let us note that, in Jacquet-Langlands theory,'?° Whittaker models 
composed of functions are used for the groups GL2 associated to algebraic num- 
ber fields over Q. Instead of (6), these functions satisfy the equality W(ugk) = 
e(u)W (g)x7r(k). They must be eigenfunctions of 2, (or of 2 for the corresponding 
functions on G) and satisfy “tempered increase” conditions at infinity. The Fourier 
series expansion of Maa series reveal their origin. Presenting the theory would take 
us too far. 


30 — Irreducible Representations of G 


To obtain representations of G corresponding to a simple g-module using right 
translations, we must renounce K-finite functions and choose Banach spaces of 
functions in which a model of the given g-module is everywhere dense. 


(i) The multiplicity one theorem. Let us first state an important result that was 
often announced: 


Theorem 50. For any irreducible unitary representation (H,7) of G and any 
character x of K, the subspace H(x) of solutions of 


has dimension < 1. 


Let us suppose that H(x) is non-trivial and let a 4 0 and b be two elements 
of this space. The representation being irreducible, b is the limit of vectors m(f)a 
where f € L(G). If E(x) = f 7(k)x(k)~‘dk is the orthogonal projection operator 
on H(x), «(f)a can be replaced by 


E(x) AEO)a =m (X* f*xX)a. 
If L(G; x) is the set of f € L(G) such that 


xX * f * xX = rie: i.e. f (k’ xk’) —'X (k’k’’) f(z) ’ 


and if, «(f) is also made to abusively denote the operator defined on H(x) byz(f), 
we get a homomorphism from the algebra L(G; y) equipped with the convolution 
product to the algebra of continuous operators on H(x). The previous argument 


124 Tn the case where I\G is compact, the representation of G on L?(I°\G) is a dis- 
crete direct sum of irreducible representations since convolutions by f € L(G) 
are Hilbert-Schmidt operators. They correspond either to holomorphic or anti- 
holomorphic forms, or to invariant solutions of Qf = s(1—s)f, which only exist 
for a countably many values of s and cannot be obtained using Maaf series. Note 
that, even when I’ has parabolic fixed points, Qf = s(1 — s)f may well have 
solutions in L?(I’\G) that cannot be obtained from Maaf series either. 

5 Automorphic Forms on GL(2) (Springer, 1972, Lecture Notes 278). See also 
R. Godement, Notes on Jacquet-Langlands theory (Institute for Advanced Study, 
Princeton, 1970) and Daniel Bump, Automorphic Forms and Representations 
(Cambridge UP, 1996), 2.8, much less systematic on this point. 


§9. SL2(R) as a Lie Group 503 


shows that the only closed subspace of H(x) invariant under the algebra A of 7(f) 
is the trivial one. 

But as 7 is assumed to be unitary, A is self-adjoint. Von Neumann’s density 
theorem [Chap. XI, n° 19, (vii)] then shows that A is everywhere dense in the 
algebra of continuous operators on H(x). Since the latter is commutative only if 
dim H(x) < 1, it will, therefore, suffice to show that L(G; x) is commutative with 
respect to the convolution product and, more generally (?), that this is the case of 
the algebra of continuous functions with compact support such that 


(30.1) f(vk) = f(ka). 


To prove the commutativity of Hecke operators, 
by 


a b = d b _ =. A _ 1 0 
(30.2) a (: a) = ¢ ‘) =wg w where w = ¢ a 


has already been used in n° 24, (iii). 

The operation transforming all functions f(x) into f[a(x)] reverses the order of 
every convolution product. Hence it is sufficient to show that f[o(x)] = f(a) for 
every solution of (1). To this end, let us write 2 = k/’hk” with k’,k” € K andh 
diagonal. Clearly, o(k) = k and o(h) = h7' for all k and h. Since K is commutative 
and contains w, by (1), 


126 the map 0: Gt> G given 


flo(a)] = f (Rh) = f (k"whw tk’) = 
= f (wok kwh) = f (kU Rh) = f (k’hK”) , 


ged.127 

The reader will probably think that there is a result similar to theorem 49 for 
non-unitary representations. This is correct if all continuous operators T on H are 
strong limits of operators m(f), f € L(G), i.e. if, for all finitely many n elements 
a; € H and r > 0, there exists f such that ||T’a; — m(f)ai|| <r for all i. For n = 1, 
this amounts to the non-existence of closed invariant subspaces. In finite dimension, 
this property suffices to show that any operator T is a 7(f) using a rather old result 


6 On functions invariant under the modular group, Hecke operators, which are 
sums of left translations, clearly commute with operators L(f) = [ L(g)f(g)dg. 
So the operators T(p) and L(f) forf(kx) = f(ak) generate a self-adjoint com- 
mutative algebra of continuous operators in L?(I’\G). Its discrete spectrum is 
of multiplicity 1. Jacquet-Langlands adelic theory explains this remarkable phe- 
nomenon far better than can be done here. 

A less miraculous proof of commutativity consists in checking the theorem for 
every finite-dimensional irreducible representation 7 [obvious: case (e) of the clas- 
sification], so that, for all functions (1), operators 7(f) commute pairwise. The 
result generalizes to finite-dimensional representations as they are direct sums 
of irreducible representations (true but not obvious). However, the coefficients 
of these representations form a function algebra which enables us to “separate” 
the points of G (obvious: it is the algebra of polynomial functions on G). Hence 
if, for some solution of (1), 7(f) = 0 for all finite-dimensional representations, 
then f = 0 (Stone-Weierstrass). As 7(g * g) = m(g * f), it follows that functions 
(1) commute, qed. This method generalizes to all semisimple linear groups; see 
R. Godement, A theory of spherical functions (Trans. AMS, 73, 1952, pp. 496— 
556). 


127 
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of Burnside. But, as far I know, there are no simple theorems covering the case of 
general Banach spaces. 

Following Harish-Chandra let us say that a representation (H,7) of G on a 
Banach space is admissible if the subspaces H(x) are finite-dimensional. Let Hx 
be their algebraic direct sum, everywhere dense in H. If (H, 7), not necessarily uni- 
tary, is topologically irreducible, the operators m(f) associated to functions (1) act 
on each H(x) in a topologically irreducible manner, hence algebraically irreducible 
since dim H(x) < +oo. As these operators commute, they have a common eigen- 
value in H(x), whence dimH(x) < 1. If (H,7) is only admissible, all a € Hx are 
analytic [n° 26, (v), theorem 43]. Thus there is a representation of g on this sub- 
space. For alla € Hx, the G-invariant closed subspace generated by a is, therefore, 
the closure of the g-invariant subspace generated by a. As every G-invariant closed 
subspace € is the closure of its intersection with Hx, which is g-invariant, the cor- 
respondence between G-invariant closed subspaces € and g-invariant subspaces of 
Hx must be bijective. Consequently, the representation of G on H is topologically 
irreducible (no closed invariant subspaces) if and only if the representation of g 
on Hx is algebraically irreducible. This result was obtained very early on for all 
semisimple groups by Harish-Chandra. As Hx is obviously generated by eigenvec- 
tors of 7(k) and hence of 7(W), the representation of g on Hx belongs to one of 
the types obtained above. 


(ii) Function space models for G: the discrete series. Consider the holomorphic 
discrete series. Supposing r > 2, theorem 27 of n° 20 shows that, for all y € H2(G), 
the representation of G on the right invariant closed subspace H(p) generated 
by y in L?(G) is unitary and irreducible. Choosing y = w,, w,r(kg) = w,(gk) = 
xr(k)wr(g) and, in conformity with the previous theorem, w, is the only f € H(y) 
with this property, up to a constant factor. HC(y) is everywhere dense in H(y) 
and it is the subspace H(y) x. 

If, instead of considering the closed subspace generated by all R(g)w,, we con- 
sider the subspace generated by all L(g)w,, we get H2(G) and the irreducible uni- 
tary representation of G on this space described in n° 16. H?(G) is therefore the 
closure in L?(G) of the subspace of L(j:)wy = pb * wr,  € U(g), which plays the 
role of HC(w,) for left translations. It is, however, the image of HC(w,) under 


f > f. In consequence, the subspaces H(w,) and H?(G) are mapped to each 


other by f + f or, equivalently, the representation of g on the K-finite vectors of 
the representation (H2(G), L) of G belongs to the discrete series (d). 

Condition r > 1 is compulsory because of theorem 14 of n° 16, (ii). If r = 1, the 
space H = H{(G) is non-trivial for 2 < p < +00, and as the spaces H(x) obviously 
have dimension 1, it is easy to see that all topologically irreducible representations 
of G corresponding to the representation r = 1 of the holomorphic discrete series 
are obtained in this way. For p = +ov, in the half-plane, as already seen at the 
start of n° 16, the space obtained corresponds to the set of holomorphic functions 
such that 


sup |y? f(2)| < too. 


But it is not a Hilbert space. To obtain one, we build on theorem 15 of n° 16: for 
r > 2, the complex Fourier transform shows that H?(P) is isomorphic to the L? 
subspace of Ri. with measure t!-"dt. Now, the complex Fourier transform applies 
equally for r = 1 since, for Im(z) > 0, the function e(tz) decreases rapidly to +00. 
For any function ¢(t),t > 0, satisfying f |y(t)|?dt < +00, the function 


(30.3) f(iz)= [ eette)at = few exp(—27ty)e(ta)dt 
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satisfies 
i Fle-+ iy)? de = / Ip(é)|? exp(—4aty)dt < / p(t)? at. 


So the functions obtained satisfy 
(30.4) sup | Fle + wy)? de = |lel2 < +00. 
y 


The converse is “the” classical Paley-Wiener theorem proved in n° 16. Hence it is 
possible to conjecture that, for r = 1, the discrete series (c) can be realized on this 
space of holomorphic functions using operators 


(30.5) Li (g7*) : f(z) 9 (cz +d) * f (gz) 


of n° 15, (ii). Condition (4) is obviously preserved if g € B. To justify the construc- 
tion, it is also necessary to check the theorem if g = w. I leave this “detail” to the 
reader’s meditations; see exercise 4 of n° 16, (v). 

Note also that the left hand side of (4) is in fact the limit of the integral as y 
tends to 0, i.e. (Plancherel) the norm on L?(R) of the function 


f(x) = lim.’ f(a + ty) = G(-2), 


which is well-defined. This indeed gives a unitary representation U of G on L?(R) 
by setting, as in (5), 


U (g"*) f(x) = (cx +d)" f [(ax + 6)/(cx + d)] . 


That these operators preserve the subspace of functions whose Fourier transforms 
are zero for t < 0 remains to be checked. All this is related to another function 
space model for the odd principal series for Re(s) = a obtained using operators 


(30.5’) Us (g-*) f(x) = (ca + d)~'|ca + d\°-? f [(ax + b)/(cx + d)] . 


(iii) Function space models for G: the even principal series. Let us consider the 
set V*(s) of solutions of 


f(uhg) = ath)’ f(g) 


which, instead of being K-finite, are continuous and let us make G act by right 
translations R(g). Setting as above fo(g) = a(g)*, the function space model HC( fo) 
for the representation of g is obviously everywhere dense in V‘(s) (expand as a 
Fourier series on K’), and is the set of K-finite vectors of the representation R 
of G on Yt(s). It is therefore topologically irreducible if HC(fo) is algebraically 
irreducible, i.e. if s ¢ Z. 

As replacing s by 1 — s is known to leave the representation invariant, there 
must be an isomorphism from V*(s) onto V*(1—s) commuting with all R(g). For 
Re(s) > 4, this is the intertwining operator'** M(s) which, associates the function 


128 See some calculations in Daniel Bump, Automorphic Forms and Representations 
(Cambridge UP, 1998), pp. 227-232. All this theory has been generalized, to start 
with by A. Knapp and E. M. Stein, Intertwining operators for semi-simple groups 
(Ann. Math., 93, 1971, pp. 489-578). 
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(30.6) M(s)e(9) = f elwug)au 


to every y € V*(s). Formally it is in Vt (1—s). To check that the integral converges, 
note that the weight 0 function a(g)* = Im(z)* is in VT(s). As y(g)/a(g)* is 
continuous and B-invariant, there is a uniform upper bound |y(g)| < Mla(g)*| = 
M|Im(z)°|. Then a simple calculation enables us to deduce that 


(30.7) |p(wug)| < M |a(wug)*| =O (\u|~?*) for large u. 


As the sign O is even an asymptotic equivalence if y is never zero, we must have 
Re(s) > 4. Finally, (7) shows that integral (6) converges normally. So, like ¢, it is 
a continuous function of g, whence M(s)y € V*(1— s) for Re(s) > 4. 

If y is K-finite, for example if 


(30.8) p(uhk) = a(h)°xr(k) = Yr,s(g) 5 


then M(s)yr,s(g) = cr(s)%r,1-s(g), where the function c,(s) is meromorphic on C 
and can be computed using the Euler I’ function. The intertwining operator can 
then be defined by analytic extension and it can be checked to be bijective at least 
on K-finite vectors if s ¢ Z. Without any calculation, M(1— s)M(s) can even be 
presumed to reduce to a scalar ...a computable one. We could make exercises out 
of this, for example: 

Exercise 1. Set 


o() = f lg(e)a lee, 


where ® € S(R*) is the function 


P(x, y) = (a + iy)” exp [—r (a? + y’)] : 


Calculate y, M(s)y and M(1— s)M(s). 

Let us now indicate how to determine those, among these models for the prin- 
cipal series, that could in some way be unitary. K-finite vectors of V‘(s) form a 
model for the series (a) of irreducible representations of g. Hence, on this space, 
there must exist a positive Hermitian form ( | ) with respect to which functions of 
weight r are pairwise orthogonal and operators it(X) symmetric for all X € G’(e), 
which amounts to saying that 7(Z) is the adjoint of 7(Z). Setting (e-|e-) = cr in 
the notation of (29.1), 


(s+r/2—1)c = —(8— 1/2) cr—2 


and as it is necessary to be able to choose c, > 0, we must have 
(s+r/2—1)(s—r/2) <0 


for all r occurring in the representation. For the even principal series and s ¢ Z, 
this is equivalent to 


Re(s) == or s€]0,1[. 


For the odd principal series and s — s ¢ Z, the condition is the same. In all cases, 
the expected Hermitian form is unique up to a constant factor. But the essential 
point is calculating the scalar products on the corresponding spaces V*(s). Let us 
do it for the even principal series. 
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First observe that, for y € Vt(s) and w € Vt (1—s), yy € V*(1), the space of 
solutions of 


y(bg) = a(b)y(g) where a(huk) = a(h). 


On the other hand, every y € Y*(s) is clearly of the form 
ota) = | Fog)alo)*arb, 


where f € L(G). Let us now show that’? there exists a right invariant positive 
linear functional u on V*(1). The B-invariant measures are given by 


b=uh=>d,b=dudh, b= hu => djb=dhdu= ah) ‘d,b, 


whence d,b = a(b)d;b. Then let us associate the functions 
F(a) = | fa)a, 
(30.9) fala) = [ Hog )ab= f Hlog)a(o)- Mayo 


to every f € L(G). The former one is in Y*(0) and the latter in V*(1). However, 
an immediate calculation shows that 


if 2 (9)F(g)dg = i pla) fa(a)dg 


for all f,p € L(G). As p can be chosen so that p? = 1, fg = 0 implies f f(g)dg = 0. 
The expected linear functional py is then given by 


(30.10) wife) = | fa)dg for all f € L(G) 


in conformity with Chap. XI. Note that (iy) is well-defined if gy € Vt(1), but not 
if py € Vt(0). 
To calculate 4 more explicitly, we write that G = Bk, whence 


(30.11) w(t) = f sda = ff tom \dibar = ff fo(wyar. 


One can also write G = BwU up to a null set. As U is unimodular, there is a left 
invariant measure on G/U, which can only be d,b. Hence 


(30.11’) (fa) = Jf feowward a | fo(wudu = M(1)fa(e) 
up to a constant factor. This is an integral of type (6) with s = 1, hence converges 


if f is continuous. Besides, fe(g) = O[Im(z)], and so fa(wu) = O(u7?) at infinity. 
Then for y € Y*(s) and ~ € Vt (1 —), let us set 


(30.12) (ev) =alev) = f ede = [ plwuyb(wuydu. 


We thereby get an invariant bilinear form under right translations. Similarly, setting 


29 Here, I resume Chap. XI, n° 15, (iv). 
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(30.13) (ple) = « (v9) 


for p € V*(s) and w € Yt(1—8), we get an invariant sesquilinear form. For 
Re(s) = 4, 1-5 = s, whence an invariant positive-definite Hermitian form on V*(s). 
As expected, the representations of the even principal series are therefore unitary 
for Re(s) = 3. To obtain a Hilbert model, we replace the continuity condition 


imposed for convenience on f € ate ) by the condition that y be square integrable 
with respect to p, i.e. that f |p(k)|>dk < +00 or f |p(wu)|?du < +00. 
The case s € ]0,1[ is hard to tackle but more instructive. As we are in the 


previous case if s = 5, we may suppose that s > , because of the symmetry 


s <> 1—s. The intertwining operator M(s) maps V*(s) to V*(1 — s), which, for 
y,v € V*(s), makes the expression 


(y, ¥) = w[e.M(s) a= [foe w(k)dk = 


(30.14) = ae ie 


well-defined. It is an invariant bilinear form on V*(s), whence an invariant sesquilin- 
ear form 


(eb) = (elY) » 


which is Hermitian if s is real. Let us show that it is positive-definite if 4 <s<l. 
Since ~ is even, 


woven foo GDH flO) GDI 


On the other hand, obviously 


-1 -1 -1 
€ , = (5 e(4 i ) = bw ( a ) with a(b) =v”. 


Hence, identifying v € R with the matrix with parameter v in U, 


moyen) = fv fo (4 ty) forte = 


“fobeG “ee 


(p,) = // y(wu)b(wou)|v|???7dudv 


for Re(s) > 3 The double integral converges since Lebesgue-Fubini reduces it to 
the latter integral in (14). Hence, if 4 <s <1, 


(yly) = [fv wu)y(wuv)|v|?°?dudv = [feo P(u + v)\v/?°~?dudu = 


= fe « B(v) |u|? dv. 


and so 


§9. SL2(R) as a Lie Group 509 


The function ®(u) = y(wu) is continuous on R, bounded, and because of (7), in- 
tegrable. The function |v|?*~? is locally integrable. Finally, as mentioned above, 
the latter integral converges. To prove the positivity of this expression, it is then 
sufficient to show that the measure |v|”*~*dv is of positive type on R. This is indeed 
the definition for ® € L(R) and the expected result will be obtained by approxi- 
mating ® by functions ®, € L(R) bounded above by |®| (dominated convergence). 
Showing that |v|?°~?du = |v|?*~1d*v is the Fourier transform of a positive measure 
is therefore sufficient [Chap. XI, n° 30]. 

We have therefore returned to the entrance of the Garden of modular delights, 
which will allow the author to come out of it. If the Mellin transform 


Z / f(e)|el*d"x = i fle)due(e), Re(s) > 0 


is associated to every reasonable function f, then, by (1.8), 


n ff Flea a) = (2n)'* cos(ns/2)2(8) f F(e)dun—s() 


for 0 < Re(s) < 1. This means precisely that the Fourier transform of dys(x) 
is m~'(2m)'~* cos(1s/2)I'(s)dpi—s(x). For the measure |v|?*~?dv that concerns us 
here, the parameter is 2s — 1. To conclude, I leave it to the reader to prove that 


p< 8 <1 0<25-1<1 & cos[m(2s— 1)/2] > 0 


End 
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don’t make editors like that anymore at a time when financiers govern everything 
— and my admiration for the forbearance shown me by Catriona Byrne: at the 
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imagine it either. All mathematicians know what they owe her. 

Without knowing any French, Uwe Matrisch created the typography in Leipzig 
from a NotaBene version encoded in my own way in order to somewhat simplify 
the transcription into TeX. M. Matrisch dealt with countless corrected proofs and 
delivery delays of various parts of the manuscript without ever complaining. May 
he too be thanked. 
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